如何在 Python 中使用 scikit-learn 库加载数据？

Scikit-learn，通常称为 sklearn，是 Python 中一个开源库，用于实现机器学习算法。

这包括分类、回归、聚类、降维等等，借助于 Python 中强大且稳定的接口。该库建立在 Numpy、SciPy 和 Matplotlib 库之上。

让我们看一个加载数据的示例：

示例

from sklearn.datasets import load_iris
my_data = load_iris()
X = my_data.data
y = my_data.target
feature_name = my_data.feature_names
target_name = my_data.target_names
print("Feature names are : ", feature_name)
print("Target names are : ", target_name)
print("\nFirst 8 rows of the dataset are : \n", X[:8])

输出

Feature names are : ['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)']
Target names are : ['setosa' 'versicolor' 'virginica']
First 8 rows of the dataset are :
[[5.1 3.5 1.4 0.2]
[4.9 3. 1.4 0.2]
[4.7 3.2 1.3 0.2]
[4.6 3.1 1.5 0.2]
[5. 3.6 1.4 0.2]
[5.4 3.9 1.7 0.4]
[4.6 3.4 1.4 0.3]
[5. 3.4 1.5 0.2]]

Learn Python in-depth with real-world projects through our Python certification course. Enroll and become a certified expert to boost your career.

解释

导入所需的包。
将此所需的数据集加载到环境中。
从数据集中分离特征和目标值。
在控制台上打印这些特征和目标。
此外，要查看数据的样本，会在控制台上打印数据的前 8 行。

AmitDiwan

更新于： 2020-12-11

339 次浏览

开启你的职业生涯

通过完成课程获得认证

开始学习

如何在 Python 中使用 scikit-learn 库加载数据？

示例

输出

解释

开启你的 职业生涯

开启你的职业生涯