sample_site

hadoop - $SPARK_HOME 中包含的 hive-site.xml 是什么样的？

我是hive的初学者，当我启动spark作业并从hive读取数据时发生了一些事情(找不到表)。我没有在$SPARK_HOME/conf中设置hive-site.xml吗？提交spark作业命令在这里bin/spark-submit--masterlocal[*]--driver-memory8g--executor-memory8g--classcom.ctrip.ml.client.Client/root/GitLab/di-ml-tool/target/di-ml-tool-1.0-SNAPSHOT.jar 最佳答案在您的Hi

中包 SPARK_HOME section hive AdminManualConfiguration-hive-sit hadoop apache-spark

java - Spark 作业失败，因为它找不到 hadoop core-site.xml

我正在尝试运行spark作业，但在尝试启动驱动程序时出现此错误:16/05/1714:21:42ERRORSparkContext:ErrorinitializingSparkContext.java.io.FileNotFoundException:Addedfilefile:/var/lib/mesos/slave/slaves/0c080f97-9ef5-48a6-9e11-cf556dfab9e3-S1/frameworks/5c37bb33-20a8-4c64-8371-416312d810da-0002/executors/driver-20160517142123-018

core-site hadoop spark SparkContext SparkSubmit java apache-spark mesos

hadoop - 在没有 mapred-site.xml 的情况下设置 hadoop mapreduce 大小

我在服务器上运行mapreduce作业，但不断收到此错误:Containerkilledonrequest.Exitcodeis143Containerexitedwithanon-zeroexitcode143Containerisrunningbeyondphysicalmemorylimits.Currentusage:1.0GBof1GBphysicalmemoryused;2.7GBof2.1GBvirtualmemoryused.Killingcontainer.当然我已经阅读了所有可能的资源并且我知道我需要在这些文件中设置配置:mapred-site.xml\\yar

hadoop mapred-site memory section mapreduce containers

hadoop - Hive 没有完全遵守 core-site.xml 中的 fs.default.name/fs.defaultFS 值

我在一台名为hadoop的机器上安装了NameNode服务。core-site.xml文件的fs.defaultFS(等同于fs.default.name)设置如下:fs.defaultFShdfs://hadoop:8020我有一个名为test_table的非常简单的表，它当前存在于HDFS上的Hive服务器中。即存放在/user/hive/warehouse/test_table下。它是在Hive中使用一个非常简单的命令创建的:CREATETABLEnew_table(record_idINT);如果我尝试将数据加载到本地表中(即使用LOADDATALOCAL)，一切都会按预期进行

core-site defaultFS code test_table hive hadoop cloudera

hadoop - hadoop.tmp.dir 到底应该设置在哪里？ core-site.xml 还是 hdfs-site.xml？

我问的是Hadoop2.x系列。互联网上对此有相互矛盾的建议。喜欢这个case他要求在core-site.xml和这个SOanswer中指定它其中提到hadoop.tmp.dir设置在hdfs-site.xml中。应该放在哪个位置？最佳答案 hadoop.tmp.dir(Abaseforothertemporarydirectories)属性，需要在core-site.xml中设置，就像exportinLinux例如:dfs.namenode.name.dirfile://${hadoop.tmp.dir}/dfs/name您可以

hadoop site code section configuration hdfs configuration-files

Hadoop:无法在 core-site.xml 中将默认文件系统设置为 HDFS

我在伪分布式模式下使用Hadoop1.0.3。而我的conf/core-site.xml设置如下:fs.default.namehdfs://localhost:9000mapred.child.tmp/home/administrator/hadoop/temp所以我认为我的默认文件系统设置为HDFS。但是，当我运行以下代码时:Configurationconf=newConfiguration();FileSystemfs=FileSystem.get(conf);我认为fs应该是一个DistributedFileSystem实例。然而，结果却是LocalFileSystem实例。

core-site Hadoop code section filesystems hdfs

hadoop - mapred-site.xml 中 mapreduce.framework.name 的经典、本地有什么区别？

官方对该参数的描述如下:TheruntimeframeworkforexecutingMapReducejobs.Canbeoneoflocal,classicoryarn.我知道值“yarn”用于MRv2，它会将mapreduce作业提交给resourcemanager。但是本地和经典之间有什么区别？哪个对应MRv1？非常感谢! 最佳答案你是对的，“yarn”代表MRv2。“经典”用于MRv1，“本地”用于MR作业的本地运行。但是为什么需要MRv1？Yarn现在已经结束测试，它比旧的MRv1框架更稳定，而您的MapReduce作

mapred-site framework section MRv MRv1 hadoop mapreduce hadoop-yarn

java - 如何在类路径中包含 hbase-site.xml

我目前正在尝试让我的HBase代码使用我的hbase-site.xml中指定的设置。它似乎使用默认设置而不是hbase-site.xml配置文件中指定的设置。更新文件后我重新启动了HBase集群，但它仍然没有使用我更新的配置文件。我使用的集群是2个节点，其中一个是master。两个节点上的配置文件都将主节点的IP指定为zookeeperquorum。我认为问题是我在hbase-site.xml中指定的设置没有被使用，因为如果我通过代码将zookeeperquorum设置为与我的hbase-site.xml中相同的值，代码运行正常，但第二个节点不能如果未通过代码指定法定人数，请联系mas

中包何在 hbase-site hbase section java hadoop classpath

hadoop - Hive 是否可以在不分区或不编辑 hive-site.xml 的情况下递归下降到子目录？

我有一些Web服务器日志，我想用Hive查询。HDFS中的目录结构如下所示:/data/access/web1/2014/09/data/access/web1/2014/09/access-20140901.log[...etc...]/data/access/web1/2014/10/data/access/web1/2014/10/access-20141001.log[...etc...]/data/access/web2/2014/09/data/access/web2/2014/09/access-20140901.log[...etc...]/data/access/we

子目子目录 access 34 STRING hadoop mapreduce hive

apache-spark - 如何在没有 hive-site.xml 的情况下将 Spark SQL 连接到远程 Hive Metastore(通过节俭协议(protocol))？

我将HiveContext与SparkSQL一起使用，并且我正在尝试连接到远程Hive元存储，设置Hive元存储的唯一方法是在类路径中包含hive-site.xml(或将其复制到/etc/spark/conf/).有没有办法在不包含hive-site.xml的情况下以编程方式在java代码中设置此参数？如果是这样，要使用什么Spark配置？最佳答案对于Spark1.x，您可以设置:System.setProperty("hive.metastore.uris","thrift://METASTORE:9083");finalSp

节俭何在 section 34 HiveContext apache-spark hive apache-spark-sql

6 7 8910 11 12