1、将表格文件读入R语言,成为数据框
read.csv () –通常读取CSV格式
read.table() –通常用于读取txt格式
2、将数据框导出表格文件:
-
csv格式:write.csv(x, file = "foo.csv")
- x可能是矩阵或者数据框
- "foo.csv"是文件名
-
txt格式: write.table()
- write.table(test,file = “excample.txt” )
3、R特有的数据格式: Rdata
- 是R语言特有的数据存储格式,无法用其他软件打开
- 保存的是变量,不是表格文件
- save() 保存—load() 加载
- save(test,file = “example.Rdata” )
- load(“example.Rdata” )
Rproject的组织形式:
- 脚本(.R)
- 数据(.Rdata),
- 表格(.txt或.csv)
4、常见报错:找不到文件
找不到文件.png
解决:
- 原因1:文件没有存放于工作目录下
(Rproject) - 原因2:拼写错误
(用tab自动补齐)
-
练习
1. 读取complete_set.txt(已保存在工作目录)
> x=read.table("complete_set.txt")
> head(x)
V1 V2 V3
1 geneA geneB geneC
2 -0.635020187971398 -0.49728008811353 0.514896730700242
3 0.91605661780324 -0.545381308500589 1.20238322656491
4 0.805995294157758 -0.315914513323816 0.27825197143441
5 0.538008107104279 -0.0673921064676487 -0.623764802577221
6 1.03110520787851 0.868414824317706 -1.61616345430907
2.查看有多少行、多少列
> ncol(x)
[1] 20
> nrow(x)
[1] 51
3.获取行名和列名
> rownames(x)
[1] "1" "2" "3" "4" "5" "6" "7" "8" "9" "10" "11" "12" "13"
[14] "14" "15" "16" "17" "18" "19" "20" "21" "22" "23" "24" "25" "26"
[27] "27" "28" "29" "30" "31" "32" "33" "34" "35" "36" "37" "38" "39"
[40] "40" "41" "42" "43" "44" "45" "46" "47" "48" "49" "50" "51"
> colnames(x)
[1] "V1" "V2" "V3" "V4" "V5" "V6" "V7" "V8" "V9" "V10" "V11"
[12] "V12" "V13" "V14" "V15" "V16" "V17" "V18" "V19" "V20"
4.导出为csv格式
> y=read.csv("gene.csv")
> y
gene sam exp
1 gene1 sample1 32
2 gene2 sample2 34
3 gene3 sample3 45
> write.csv(y,"lianxi4")
5.保存为Rdata
> save(y,file = "lianxi5.Rdata")
6.加载class.Rdata,查看每一列的数据类型
> load("G:/生信技能树-长沙/2、三天生信实际讲课/1、第一天上课/Day1-R/01-get_start/lianxi5.Rdata")