valtemp=sqlContext.sql(s"SELECTA,B,C,(CASEWHEN(D)in(1,2,3)THEN((E)+0.000)/60ELSE0END)ASZfromTEST.TEST_TABLE")valtemp1=temp.map({temp=>((temp.getShort(0),temp.getString(1)),(USAGE_TEMP.getDouble(2),USAGE_TEMP.getDouble(3)))}).reduceByKey((x,y)=>((x._1+y._1),(x._2+y._2)))我希望在scala中完成转换,而不是上面在配置单
我是scala的新手,我想为一个数据集迭代三个循环并执行一些分析。例如我的数据如下:Sample.csv1,100,0,NA,0,1,0,Friday,1,51,100,0,NA,0,1,0,Wednesday,1,91,100,1,NA,0,1,0,Friday,1,51,100,2,NA,0,1,0,Friday,1,51,101,0,NA,0,1,0,Friday,1,51,101,1,NA,0,1,0,Friday,1,51,101,2,NA,0,1,0,Friday,1,51,102,0,NA,0,1,0,Friday,1,51,102,1,NA,0,1,0,Friday,1
我在使用Scala时遇到ApacheSpark的一个问题。我正在尝试创建一个Spark应用程序,它根据用户输入打印一个RDD。输入数据是这样的:List("aaaa","aaaa","dfddf","aaaa","aaaa","dfddf","aaaa","aaaa","dfddf","aaaa","aaaa","dfddf","aaaa","aaaa","dfddf")代码是这样的:valwSchemaString="col1col2col3col4";valwSchema=StructType(wSchemaString.split("").map(fieldName=>Struc