网站首页 > 博客 > 正文

【深耕 Python】Data Science with Python 数据科学（8）pandas数据结构：Series和DataFrame

江湖小白博客 2024-04-11 1 0

写在前面

关于数据科学环境的建立，可以参考我的博客：

【深耕 Python】Data Science with Python 数据科学（1）环境搭建

往期数据科学博文：

【深耕 Python】Data Science with Python 数据科学（2）jupyter-lab和numpy数组

【深耕 Python】Data Science with Python 数据科学（3）Numpy 常量、函数和线性空间

【深耕 Python】Data Science with Python 数据科学（4）（书337页）练习题及解答

【深耕 Python】Data Science with Python 数据科学（5）Matplotlib可视化（1）

【深耕 Python】Data Science with Python 数据科学（6）Matplotlib可视化（2）

【深耕 Python】Data Science with Python 数据科学（7）书352页练习题

代码说明：由于实机运行的原因，可能省略了某些导入（import）语句。

Jupyter 代码片段 1：定义简单的Series

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

print(pd.Series([1, 2, 3, "foo", np.nan, "bar"]))

print()

print(pd.Series([1, 2, 3, "foo", np.nan, "bar"]).dropna())

运行结果：

Jupyter 代码片段 2：Series的索引、自定义索引

Series的索引支持自定义，可以通过索引访问各个成员、进行切片操作等。

from numpy.random import default_rng

print(pd.Series([1, 2, 3, "foo", np.nan, "bar"]).index)

rng = default_rng()

print()

s = pd.Series(rng.standard_normal(5), index=["a", "b", "c", "d", "e"])

print(s)

print()

print(s[0])

print()

print(s[1:3])

print()

print(s["c"])

print()

print(s.keys())

print()

print(s.index)

运行结果：

Jupyter 代码片段 3：简单直方图的绘制

使用1000个标准正态分布的样本点，绘制直方图：

s = pd.Series(rng.standard_normal(1000))

s.hist()

plt.show()

运行结果：

Jupyter 代码片段 4：DataFrame的构造、访问和映射

from math import tau

from numpy.random import default_rng

rng = default_rng()

df = pd.DataFrame(

{

"Number": 1.0,

"String": "foo",

"Angles": np.linspace(0, tau, 5),

"Random": pd.Series(rng.standard_normal(5)),

"Timestamp": pd.Timestamp("20221020"),

"Size": pd.Categorical(["tiny", "small", "mid", "big", "huge"])

}

)

print(df)

print()

print(df["Size"])

print()

print(df["Random"].mean())

print()

print(df.describe())

print()

sizes = {"tiny": 4, "small": 8, "mid": 12, "big": 16, "huge": 24}

df["Size"].map(sizes)

运行结果：

参考文献 Reference

《Learn Enough Python to be Dangerous——Software Development, Flask Web Apps, and Beginning Data Science with Python》, Michael Hartl, Boston, Pearson, 2023.

精彩文章

评论可见，请评论后查看内容，谢谢！！！评论后请刷新页面。

本文由用户于 2024-04-11 发布在夸智网，如有疑问，请联系我们。
本文链接：https://www.kuazhi.com/post/713733686.html

夸智网

【深耕 Python】Data Science with Python 数据科学（8）pandas数据结构：Series和DataFrame

人工AI绘画特效怎么做

与AI绘画有关的标题怎么写

发表评论取消回复

夸智网

【深耕 Python】Data Science with Python 数据科学（8）pandas数据结构：Series和DataFrame

人工AI绘画特效怎么做

与AI绘画有关的标题怎么写

相关文章

发表评论取消回复