Time-series

python - Pandas 的时间加权平均值

在Pandas0.8中计算TimeSeries的时间加权平均值的最有效方法是什么？例如，假设我想要df.y-df.x的时间加权平均值，如下所示:importpandasimportnumpyasnptimes=np.datetime64('2012-05-3114:00')+np.timedelta64(1,'ms')*np.cumsum(10**3*np.random.exponential(size=10**6))x=np.random.normal(size=10**6)y=np.random.normal(size=10**6)df=pandas.DataFrame({'x':

python - 更改 Pandas Dataframe 中的时间频率

我有一个如下所示的PandasDataFrame。dfABdate_time2014-07-0106:03:59.61400062.1250NaN2014-07-0106:03:59.69200062.2500NaN2014-07-0106:13:34.52400062.2500241.06252014-07-0106:13:34.60200062.2500241.50002014-07-0106:15:05.39900062.2500241.37502014-07-0106:15:05.39900062.2500241.25002014-07-0106:15:42.00400062.

Dataframe python 2014 NaN code pandas time-series time-frequency

python - time.sleep 对处理器有帮助吗？

最近我在StackOverflow(Python)上冲浪，看到了thispostAaronHall声称constantlyrunningwhileloopscanconsumealotofprocessingpower.Addingasleepperiod(evenonlyasecond)cangreatlyreducethatusage.这是真的吗？如果是这样，怎么会呢？同样的规则是否也适用于其他编程语言(即C++)？最佳答案 TL;DR如果您正在轮询每分钟发生一次的事件，您可能不希望每纳秒检查一次。是的，这是真的。在线程中休眠

python sleep 纳秒 section expression c++

python pandas 按一天中的小时求和

我正在使用以下每小时计数(df)的数据集:datframe有8784行(2016年，每小时)。我想看看是否有每日趋势(例如，早上时间是否有所增加。为此，我想创建一个具有一天中的小时(从0到24)的图x轴和y轴上的骑车人数量(类似于下图中来自http://ofdataandscience.blogspot.co.uk/2013/03/capital-bikeshare-time-series-clustering.html的图片)。我尝试了pivot、resample和set_index的不同方法，并使用matplotlib绘制它，但没有成功。换句话说，我找不到一种方法来总结特定时间的每

求和 python code 39 pandas matplotlib time-series

python - 将 JSON 读取到 pandas 数据框 - ValueError : Mixing dicts with non-Series may lead to ambiguous ordering

我试图将下面的JSON结构读入pandas数据框，但它抛出了错误消息:ValueError:Mixingdictswithnon-Seriesmayleadtoambiguousordering.Json数据:{"status":{"statuscode":200,"statusmessage":"EverythingOK"},"result":[{"id":22,"club_id":16182},{"id":23,"club_id":16182},{"id":24,"club_id":16182},{"id":25,"club_id":16182},{"id":26,"club_id

ValueError non-Series 34 section club_id python json pandas

python - Matplotlib 删除缺失数据的插值

我正在使用Matplotlib绘制时间序列数据，但序列中缺少一些数据。Matplotlib隐式地将最后一个连续数据点加入到下一个数据点。但如果数据丢失，情节看起来很难看。以下是获得的情节。可以看出，在4月30日标记附近，数据缺失，Matplotlib加入点。下图也是数据的散点图。散点图掩盖了这个错误，但在这种情况下，相邻的数据点不会联合。此外，鉴于涉及大量数据点，散点图非常慢。此类问题的推荐解决方案是什么。最佳答案如果你能确定断点应该在哪里，你可以:分解数据并手工绘制每个“部分”在空白处的数据中插入np.nan参见示例Plotp

缺失 Matplotlib section code python time-series scatter-plot

python - 如何转换 HH :MM:SS string to UNIX epoch time?

我有一个程序(sarcommandlineutility)，它输出带有时间列的行。我用我的python脚本解析这个文件，我想将sar的02:31:33PM转换成纪元，例如1377181906(当前年、月、日以及来自上述字符串的小时、分钟和秒)。这怎么能以不那么麻烦的方式完成呢？我试着自己做这件事，但坚持使用时间/日期时间和他们的方法群。最佳答案这是一种方法:使用strptime将字符串读入日期时间通过replace将日期时间对象的年月日设置为当前日期的年月日通过calendar.timegm将日期时间转换成unix时间戳>>>f

python string code datetime section time

python Pandas : drop rows of a timeserie based on time range

我有以下时间序列:start=pd.to_datetime('2016-1-1')end=pd.to_datetime('2016-1-15')rng=pd.date_range(start,end,freq='2h')df=pd.DataFrame({'timestamp':rng,'values':np.random.randint(0,100,len(rng))})df=df.set_index(['timestamp'])我想删除这两个时间戳之间的行:start_remove=pd.to_datetime('2016-1-4')end_remove=pd.to_datetime

timeserie python code remove section pandas

python - 类型错误 : cannot convert the series to <class 'float' >

我有一个数据框(df)，如下所示:dateA2001-01-021.00222001-01-031.10332001-01-041.14962001-01-051.10332015-03-30126.37002015-03-31124.43002015-04-01124.25002015-04-02124.8900对于整个时间序列，我尝试将今天的值除以昨天的值并使用以下内容记录结果:df["B"]=math.log(df["A"]/df["A"].shift(1))但是我得到以下错误:TypeError:cannotconverttheseriesto我该如何解决这个问题？我尝试使用以

amp convert section code pre python python-3.x

python pandas.Series.isin 不区分大小写

我想用数据在列表中的DataFrame的列之一过滤掉一些行。df[df['column'].isin(mylist)]但是我发现它是区分大小写的。有没有使用不区分大小写的“.isin()”的方法？最佳答案一种方法是比较系列的小写或大写与列表的相同df[df['column'].str.lower().isin([x.lower()forxinmylist])]这里的优点是我们不保存对原始df或列表的任何更改，从而使操作更加高效考虑这个虚拟df:ColorVal0Green11Green12Red23Red24Blue35Blue

python pandas code section pre series

80 81 828384 85 86