SKLEARN_草庐IT

python - iPython:无法导入名为 sklearn 的模块

我可以使用python解释器导入sklearn，但是当我尝试在iPythonnotebook中执行相同操作时，iPython会抛出ImportError。知道是什么导致了这个问题吗？我需要在iPython中使用一个模块。我不确定这是否有用，但这是我在我的机器上安装的软件包的一个子集。我按照此处有关安装过程的说明进行操作:http://shanshanchen.com/2013/05/29/install-numpy-scipy-scikit-learn-on-mac-os-x-for-data-miners/ 最佳答案问题可以通过

名为 iPython section image python python-import jupyter-notebook

python - sklearn.linear_model.LogisticRegression 每次都返回不同的系数，尽管设置了 random_state

我正在拟合逻辑回归模型并将随机状态设置为固定值。每次我进行“拟合”时，我都会得到不同的系数，例如:classifier_instance.fit(train_examples_features,train_examples_labels)LogisticRegression(C=1.0,class_weight=None,dual=False,fit_intercept=True,intercept_scaling=1,penalty='l2',random_state=1,tol=0.0001)>>>classifier_instance.raw_coef_array([[0.071

LogisticRegression linear_model 0.07110193825129411394 0.071101940040772596 07110193825129411394 python scikit-learn logistic-regression

python - 如何从 sklearn 的 CCA 模块中获得第一个典型相关性？

在适用于Python的scikit-learn中，有一个模块调用cross_decomposition，其中包含典型相关分析(CCA)类。我一直在试图弄清楚如何给出形状为(n,m)的第2类多维向量并获得第一个典型相关系数。查看文档，一个小示例脚本如下。fromsklearn.cross_decompositionimportCCAimportnumpyasnpU=np.random.random_sample(500).reshape(100,5)V=np.random.random_sample(500).reshape(100,5)cca=CCA(n_components=1)cc

典型 sklearn code section python scikit-learn

python - 堆叠 RBM 以在 sklearn 中创建深度信念网络

根据这个website，深度信念网络只是将多个RBM堆叠在一起，使用前一个RBM的输出作为下一个RBM的输入。在scikit-learn中documentation，有一个使用RBM对MNIST数据集进行分类的示例。他们将RBM和LogisticRegression放入管道中以实现更高的准确性。因此我想知道是否可以将多个RBM添加到该管道中以创建深度信念网络，如以下代码所示。fromsklearn.neural_networkimportBernoulliRBMimportnumpyasnpfromsklearnimportlinear_model,datasets,metricsfr

中创堆叠 logistic RBM train python machine-learning scikit-learn deep-learning

python - 使用 SKLearn 构建模型时忽略列

使用R，可以在使用以下语法构建模型时忽略变量(列):model=lm(dependant.variable~.-ignored.variable,data=my.training,set)当您的数据集包含索引或ID时，它会非常方便。假设您的数据是Pandas数据帧，您将如何使用Python中的SKlearn来做到这一点？最佳答案所以这是我自己的代码，我去年在StackOverflow上做了一些预测:from__future__importdivisionfrompandasimport*fromsklearnimportcros

SKLearn python section basic_feature_names 39 scikit-learn

python - 在 sklearn 管道中合并的多个管道？

有时我设计的机器学习管道看起来像这样:通常我必须使用我自己的“组合功能”功能将这些“拆分”管道组合在一起。但是，如果我能将其放入sklearnPipeline对象中，那就太好了。我该怎么做呢？(伪代码很好。) 最佳答案只要“整个数据集”表示相同的特征，这就是FeatureUnion做:make_pipeline(make_union(PolynomialFeatures(),PCA()),RFE(RandomForestClassifier()))如果你有两组不同的特征要合并，你首先需要将它们放入一个数据集中，然后让Feature

sklearn python section code noreferrer machine-learning scikit-learn

python - 带有sklearn的python中的轮廓系数

我在使用sklearn计算python中的轮廓系数时遇到了问题。这是我的代码:fromsklearnimportdatasetsfromsklearn.metricsimport*iris=datasets.load_iris()X=pd.DataFrame(iris.data,columns=col)y=pd.DataFrame(iris.target,columns=['cluster'])s=silhouette_score(X,y,metric='euclidean',sample_size=int(50))我得到错误:IndexError:indicesareout-of-b

python sklearn indices silhouette_score silhouette cluster-analysis scikit-learn

python - sklearn 上的套索不收敛

当我运行类似的东西时importnumpyfromsklearnimportlinear_modelA=#somethingb=#somethingclf=linear_model.Lasso(alpha=0.015,fit_intercept=False,tol=0.00000000000001,max_iter=10000000000000,positive=True)clf.fit(A,b)我得到错误:usr/local/lib/python2.7/dist-packages/scikit_learn-0.14.1-py2.7-linux-x86_64.egg/sklearn/l

sklearn python section linear_model machine-learning least-squares scikit-learn

python - 如何输出sklearn standardscaler

我使用preprocessing.standardscaler在sklearn中标准化了我的数据。问题是如何将其保存在我的本地以备后用？谢谢最佳答案如果我对你的理解是正确的，你想保存你训练好的模型以便它可以再次加载正确吗？有两种方法，一种是使用python的pickle，一种是使用joblib。推荐的方法是joblib，因为这会产生比pickle小得多的文件，pickle会转储对象的字符串表示形式:fromsklearn.externalsimportjoblibjoblib.dump(clf,'filename.pkl')#t

standardscaler sklearn code section joblib python scikit-learn

python - 保留(pickle)自定义 sklearn 管道的推荐方法是什么？

我构建了一个sklearnpipeline它结合了标准支持向量回归组件和一些创建特征的自定义转换器。然后将此管道放入一个经过训练然后pickle的对象中(thisseemstobetherecommendedway)。unpickled对象用于进行预测。为了分发，这被转换为带有pyinstaller的可执行文件.当我从单元测试中调用unpickled回归对象时，它工作正常。但是，当我尝试使用PyInstaller二进制文件进行预测时，我得到一个很长的stacktrace结束于:module=loader.load_module(fullname)File"messagestream.p

自定 sklearn section code python scikit-learn pickle pyinstaller