Python 笔记 #13# Pandas: Viewing Data

感觉很详细：数据分析：pandas 基础

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

dates = pd.date_range(‘20180116‘, periods=3) # 创建 16 17 18 等六个日期

df = pd.DataFrame(np.random.randn(3,4), index=dates, columns=list(‘ABCD‘)) # 这是二维的，类似于一个表！
# 通过 numpy 随机了一个 3 * 4 的数据，这和行数、列数是相对应的
# print(df)
#                    A         B         C         D
# 2018-01-16 -0.139759  0.857653  0.754470  0.224313
# 2018-01-17  1.565070  0.521973 -1.265168 -0.278524
# 2018-01-18 -0.668574 -0.527155  0.877785 -1.123334

# print(df.head(1)) # 默认值是 5
#                    A         B         C         D
# 2018-01-16 -0.039203  1.211976  0.664805  0.307147

df.tail(5) # 同上，顾名思义

# print(df.index) # 顾名思义 + 1
# print(df.columns)
# DatetimeIndex([‘2018-01-16‘, ‘2018-01-17‘, ‘2018-01-18‘], dtype=‘datetime64[ns]‘, freq=‘D‘)
# Index([‘A‘, ‘B‘, ‘C‘, ‘D‘], dtype=‘object‘)

# print(df.describe()) # 对每列数据做一些简单的统计学处理
#               A         B         C         D
# count  3.000000  3.000000  3.000000  3.000000
# mean  -0.163883 -0.107242 -0.621706  0.618341
# std    0.360742  0.429078  0.800366  0.609524
# min   -0.505212 -0.502887 -1.352274  0.055032
# 25%   -0.352602 -0.335291 -1.049444  0.294803
# 50%   -0.199991 -0.167695 -0.746613  0.534574
# 75%    0.006782  0.090581 -0.256421  0.899995
# max    0.213556  0.348857  0.233770  1.265416

# print(df.T) # 转置（Transposing）
#    2018-01-16  2018-01-17  2018-01-18
# A   -1.137015   -0.067200    0.737709
# B   -1.141811    0.335953    1.023016
# C    2.481266   -0.957599    0.011144
# D    1.485434   -0.605588    0.592746

# print(df)
# print(df.sort_index(axis=1, ascending=False)) # axis=1 按照列名排序 axis=0 按照行名排序
#                    A         B         C         D
# 2018-01-16 -0.787226  0.321619  1.097938 -0.701082
# 2018-01-17 -0.417257 -0.163390 -0.943166 -0.497475
# 2018-01-18  0.486670 -0.733582  1.923475 -1.145891
#                    D         C         B         A
# 2018-01-16 -0.701082  1.097938  0.321619 -0.787226
# 2018-01-17 -0.497475 -0.943166 -0.163390 -0.417257
# 2018-01-18 -1.145891  1.923475 -0.733582  0.486670

# print(df.sort_values(by=‘B‘))
#                    A         B         C         D
# 2018-01-17  0.817088 -0.792903  1.643429 -0.008784
# 2018-01-18  0.540910  0.662119  0.190846 -0.960926
# 2018-01-16  0.333727  1.196133 -0.527796  0.677337

原文地址：https://www.cnblogs.com/xkxf/p/8306588.html

时间： 2024-11-13 09:47:08

Python 笔记 #13# Pandas: Viewing Data的相关文章

MIT麻省理工学院公开课：计算机科学及编程导论 Python 笔记1-3

Lecture1:Goals of the course; what is computation; introduction to data types, operators, and variables Python High (√) VS. low General (√) VS. targetted Interpreted (√) VS. compile Syntax语法:what are legal expressions "cat dog boy " Static seman

Python 笔记 #14# Pandas: Selection

import pandas as pd import numpy as np import matplotlib.pyplot as plt dates = pd.date_range('20180116', periods=3) # 创建 16 17 18 等六个日期 df = pd.DataFrame(np.random.randn(3,4), index=dates, columns=list('ABCD')) # 这是二维的,类似于一个 # Getting # print(df['A']

Python 笔记 #18# Pandas: Grouping

10 Minutes to pandas 引 By “group by” we are referring to a process involving one or more of the following steps Splitting the data into groups based on some criteria Applying a function to each group independently Combining the results into a data st

Python 笔记 #17# Pandas: Merge

10 Minutes to pandas Concat df = pd.DataFrame(np.random.randn(10, 4)) print(df) # break it into pieces pieces = [df[:3], df[3:7], df[7:]] print(pd.concat(pieces)) # 0 1 2 3 # 0 0.879526 -1.417311 -1.309299 0.287933 # 1 -1.194092 1.237536 -0.375177 -0

20180426学习python笔记（pandas使用）

原文地址:https://www.cnblogs.com/beijingjiaotongdaxue/p/8955138.html

python 学习笔记 13 -- 常用的时间模块之time

Python 没有包含对应日期和时间的内置类型,不过提供了3个相应的模块,可以采用多种表示管理日期和时间值: * time 模块由底层C库提供与时间相关的函数.它包含一些函数用于获取时钟时间和处理器的运行时间,还提供了基本解析和字符串格式化工具 * datetime 模块为日期.时间以及日期时间值提供一个更高层接口.datetime 中的类支持算术.比较和时区配置. * calendar 模块可以创建周.月和年的格式化表示.它还可以用来计算重复事件.给定日期是星期几,以及其他基

python基础教程_学习笔记13：标准库：一些最爱——sys

标准库:一些最爱 sys sys这个模块让你能够访问与python解释器联系紧密的变量和函数. sys模块中一些重要的函数和变量函数/变量描述 argv 命令行参数,包括脚本名称 exit([arg]) 退出当前程序,可选参数为给定的返回值或者错误信息 modules 映射模块名字到载入模块的字典 path 查找模块所在目录的目录名列表 platform 类似sunos5或者win32的平台标识符 stdin 标准输入流--一个类文件对象 stdout 标准输出流--一个类文件对象 stde

Python pandas.io.data 模块迁移

这段时间用pandas做数据分析, import pandas.io.data as web 然后得到下面的错误提示 "The pandas.io.data module is moved to a separate package " ImportError: The pandas.io.data module is moved to a separate package (pandas-datareader). After installing the pandas-datarea

玩蛇(Python)笔记之基础Part3

玩蛇(Python)笔记之基础Part1 一.集合 1.set 无序,不重复序列 {}创建,直接写元素 2.set功能 __init__()构造方法,,使用强制转换就会调用此方法 1 set1 = {'year', 'jiujiujiu'} 2 print(type(set1)) 3 # 创建集合 4 s = set() # 创建空集合 5 li = [11, 22, 11, 22] 6 s = set(li) set 3.集合的基本操作 1 # 操作集合 2 s1 = set() 3 s1.a