我是scala的新手,我想为一个数据集迭代三个循环并执行一些分析。例如我的数据如下:Sample.csv1,100,0,NA,0,1,0,Friday,1,51,100,0,NA,0,1,0,Wednesday,1,91,100,1,NA,0,1,0,Friday,1,51,100,2,NA,0,1,0,Friday,1,51,101,0,NA,0,1,0,Friday,1,51,101,1,NA,0,1,0,Friday,1,51,101,2,NA,0,1,0,Friday,1,51,102,0,NA,0,1,0,Friday,1,51,102,1,NA,0,1,0,Friday,1
我在使用Scala时遇到ApacheSpark的一个问题。我正在尝试创建一个Spark应用程序,它根据用户输入打印一个RDD。输入数据是这样的:List("aaaa","aaaa","dfddf","aaaa","aaaa","dfddf","aaaa","aaaa","dfddf","aaaa","aaaa","dfddf","aaaa","aaaa","dfddf")代码是这样的:valwSchemaString="col1col2col3col4";valwSchema=StructType(wSchemaString.split("").map(fieldName=>Struc
我有一个数据框df,其中有两列是这样的。+-----+------------------+|x|y|+-----+------------------+|0.0|{12,16,17,18,19}||0.0|{18,16,17,18,19}||0.0|{15,16,67,18,19}||0.0|{65,16,17,18,19}||0.0|{9,16,17,18,19}||1.0|{12,16,17,28,39}||0.0|{24,16,17,28,19}||0.0|{90,16,17,18,29}||1.0|{30,16,17,18,19}||1.0|{28,16,17,18,19}|