pyhton_Pandas

python - 在 Pandas 中，如何根据值的类型过滤系列？

给定一个系列importpandasaspds=pd.Series(['foo','bar',42])我想获得一个“子系列”pd.Series(['foo','bar'])，其中所有值都是字符串。我试过像这样的bool索引:s[isinstance(s,str)]但这给出了一个KeyError:False到目前为止，在我寻找合适方法的过程中，我遇到了select，但这对标签强加了一个标准，而不是值。在这种情况下，如何根据值(的类型)进行过滤？最佳答案使用apply或列表理解:s[s.apply(lambdax:isinstanc

python - Pandas 多索引 : Divide all columns by one column

我有一个数据框results的形式TOTEXPPQTOTEXPCQFINLWT21yearquarter1319.183392e+095.459961e+091271559.39822.907887e+091.834126e+09481169.672我试图将所有(前两列)除以最后一列。我的尝试是weights=results.pop('FINLWT21')results/weights但是我明白了ValueError:cannotjoinwithnolevelspecifiedandnooverlappingnames我不明白:索引中有重叠的名称:weights.head()yearq

多索 columns code section pre python pandas

python - 从一个函数在 Pandas Dataframe 中创建多列

我是一个Python新手，所以我希望我的两个问题是清楚和完整的。我在下面以csv格式发布了实际代码和测试数据集。我已经能够构建以下代码(主要是在StackOverflow贡献者的帮助下)来使用Newton-Raphson方法计算期权合约的隐含波动率。该过程在确定隐含波动率时计算Vega。虽然我可以使用PandasDataFrameapply方法为隐含波动率创建一个新的DataFrame列，但我无法为Vega创建第二个列。当函数同时返回IV和Vega时，有没有办法创建两个单独的DataFrame列？我试过:从函数返回iv,vegadf[['myIV','Vega']]=df.apply(

中创多列 39 code 00 python pandas multiple-columns jit numba

python - 在 Pandas 中将列拆分为多行的快速方法

我有以下数据框:importpandasaspddf=pd.DataFrame({'gene':["foo","bar//lal","qux","woz"],'cell1':[5,9,1,7],'cell2':[12,90,13,87]})df=df[["gene","cell1","cell2"]]df看起来像这样:Out[6]:genecell1cell20foo5121bar//lal9902qux1133woz787我想做的是拆分“基因”列，使其结果如下:genecell1cell2foo512bar990lal990qux113woz787我目前的做法是这样的:importp

多行 python 34 cell code pandas

python - 在 Pandas DataFrame 的字符串中漂亮地打印换行符

我有一个PandasDataFrame，其中一列包含字符串元素，这些字符串元素包含我想按字面打印的新行。但它们只是在输出中显示为\n。也就是说，我要打印这个:posbidder011223但这就是我得到的:posbidder011223我怎样才能完成我想要的？我可以使用DataFrame，还是必须恢复为一次一行手动打印填充列？这是我目前所拥有的:n=4output=pd.DataFrame({'pos':range(1,n+1),'bidder':['']*n})bids={'alice':3,'bob':3}used_pos=[]forbidder,posinbids.items()

换行符 DataFrame bidder section code python string python-3.x pandas printing

python - 有包含列表的 Pandas 列，如何将唯一的列表元素转换为列？

我编写了一个网络抓取工具来从产品表中提取信息并构建数据框。数据表有一个描述列，其中包含描述产品的逗号分隔的属性字符串。我想在数据框中为每个唯一属性创建一个列，并用该属性的子字符串填充该列中的行。下面的示例df。PRODUCTSDATEDESCRIPTIONProductA2016-9-12Steel,Red,HighHardnessProductB2016-9-11Blue,Lightweight,SteelProductC2016-9-12Red我认为第一步是将描述分成一个列表。In:df2=df['DESCRIPTION'].str.split(',')Out:DESCRIPTIO

python Pandas 39 Product 2016 numpy dataframe pivot

python - 在 Pandas 中设置多列索引

我像这样制作数据框。df=pd.DataFrame({'class':['A','A','A','A','A','B','B','B','B','B'],'number':[1,2,3,4,5,1,2,3,4,5],'math':[90,20,50,30,57,67,89,79,45,23],'english':[40,21,68,89,90,87,89,54,21,23]})我想使用一些pandas方法将索引转换为此。(例如set_index、堆栈、、、)df1=pd.DataFrame(np.random.randint(1,100,(5,4)),columns=[['A','A'

中设多列 39 pandas code python numpy

python - 将 Pandas 列中的日期时间格式化为季度

我有一个DataFrame，其中包含DateTime索引中的列，代表季度，例如:2000-03-3100:00:00如何将其转换为'2000q1'？我查看了文档，但他们只提到了DateTimeIndex.quarterformat='%Y%q'不起作用。选项on='%Y%q也没有最佳答案您可以使用to_period("Q"):df.index=df.index.to_period("Q")importpandasaspddf=pd.DataFrame({"y":[1,2,3]},index=pd.to_datetime(["20

化为 python code 2000 section pandas datetime format date-formatting

Python Pandas 添加文件名列 CSV

我的python代码在下面的例子中工作正常。我的代码结合了一个CSV文件目录并匹配标题。但是，我想更进一步-如何添加一个列来附加所用CSV的文件名？importpandasaspdimportglobglobbed_files=glob.glob("*.csv")#createsalistofallcsvfilesdata=[]#pd.concattakesalistofdataframesasanagrumentforcsvinglobbed_files:frame=pd.read_csv(csv)data.append(frame)bigframe=pd.concat(data,i

名列 Python code section csv pandas dataframe glob

python - reshape Pandas 数据框

假设有这样一个数据框:df=pd.DataFrame([[1,2,3,4],[5,6,7,8],[9,10,11,12]],columns=['A','B','A1','B1'])我想要一个如下所示的数据框:什么不起作用:new_rows=int(df.shape[1]/2)*df.shape[0]new_cols=2df.values.reshape(new_rows,new_cols,order='F')当然我可以遍历数据并制作一个新的列表列表，但必须有更好的方法。有什么想法吗？最佳答案您可以使用lreshape,对于id列

reshape python code noreferrer section pandas dataframe lreshape

91 92 939495 96 97