python数据类型

基本数据类型（表示一个数字）

整型： 5，-7，23
浮点型： 1.23， 2.5e4
复数型： 3.6+1.5j
布尔型：True， False

序列类型：

字符串：用单引号，双引号，三引号括起来的都是。其中三引号对于长文本特别好用。
列表：方括号表示[]，中间用逗号隔开
元组：与列表类似，但用括号（）表示，元组值不可变，列表可变。

映射类型：

字典：用大括号表示，用逗号隔开每一项，每一项是键值对（key：value）

函数：一段可复用的代码，一般具有特定功能

分为内置函数和自定义函数

# 变量类型
type(1)
# 类型转换
a = int(3.77)
b = float("4.5")
c = str(55)
print(a,b,c)
type(c)

# 元素个数
len([1,2,3])
len("o2dd")

# range函数生成整数序列
list1 = range(0,9)
type(list1)

# 强制转换为list类型
list(list1)

# 找最值
max(5,8,2,5)
min([3,9,1,8])

#求近似值
round(2.7)

布尔表达式
x==y 比较x和y是否相等
x!=y 比较x和y是否不等
x<y
x>y
x is y x和y是否相同类型必须一致
x is not y x和y是否不同

a = 1
b = 1
c = 1.0
d = '1.0'
print(a==c,a is b, a is c)
True True False

逻辑运算符 and or not 与或非
and 同真为真
or 同假为假
not 取反

python 使用缩进来表示层级

条件语句
if else

循环语句
for variable in sequence
可以是列表,元组，range，字符串，
for x in "family":
print(x)
for x in range(0,5):
print(x)
for x in [5,6,7,9]:
print(x)
for x in (10,11,12):
print(x)

可以使用enumerate函数同时获取索引和值
numbers = [21,34,45,32,43,8]
for index,value in enumerate(numbers):
print(index,value)
0 21
1 34
2 45
3 32
4 43
5 8

词频
文本中某个词出现的次数成为词频
词频计算：

1. 使用split（）函数拆分单词
2. 获取不重复的单词
unique_words = list()
for word in words:
    if word not in unique_words:
        unique_words.append(word)
3. 创建数组记录出现的次数
counts = [0] * len(unique_words)
for word in words:
    index = unique_words.index(word)
    counts[index] = counts[index] + 1
4.  计算次数数组中的最大值，与不重复的单词数组下标是对应的。
unique_words[counts.index(7)]

python数据类型

基本数据类型（表示一个数字）

序列类型：

映射类型：

函数：一段可复用的代码，一般具有特定功能

推荐阅读更多精彩内容