四步学Python( 免费书 & 资源)

Four steps to master machine learning with python (including free books & resources)

To understand and apply machine learning techniques you have to learn Python or R. Both are programming languages similar to C, Java or PHP. However, since Python and R are much younger and “farer away” from the CPU, they are easier. The advantage of Python is that it can be adopted to many other problems than R, which is only used for handling data, analysing it with e.g. machine learning and statistic algorythms and ploting it in nice graphs. Because Python has a broader distribution (hosting websites with Jango, natural language proecssing, accessing APIs of websites such as Twitter, Linkedin etc.) and resembles more classical programming languages like C Python is more popular.

The four steps of learning machine learning in python

First you have to learn the basics of Python using books, courses and videos.

Then you have to master the different moduls such as Pandas, Numpy, Matplotlib and Natural Language Processing (NLP) in order to handle, clean, plot and understand data.

Afterwards you have to able to scrap data from the web which is either done by using APIs of websites or the web-scraping moduls Beautiful Soup. Web scraping allows you to collect data which you feed into you machine learning algorithms.

In the last step you have to learn machine learning (ML) tools like Scikit-Learn or implement ML-algorithm from scratch.

1. Getting started with Python:

And easy and fast way to learn Python is to register atcodecademy.comand imediately start to code and learn the basics of python. A classic is the websitelearnpythonthehardwaywhich is referenced by a lot of python programmers. A good PDF is abyte of python. Alist of python resourcesfor beginners is also provided by the python community. A book from O’Reilley is Think Python, which can be downloaded forfree from here. A last resource isIntroduction to Python for Econometrics, Statistics and Data Analysiswhich also covers the basics of Python.

2. Important Modules for machine learning

The most important modules for machine learning areNumPy,Pandas,MatplotlibandIPython. A book covering a couple of these modules isData Analysis with Open Source Tools.  The free bookIntroduction to Python for Econometrics, Statistics and Data Analysisfrom 1. also covers Numpy, Pandas, matplotlib and IPython. Another resource isPython for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, which also covers the most important modules. Her are other free Numpy (Numerical Python,Numpy Userguide,Guide to NumPy), Pandas (Pandas, Powerful Python Data Analysis Toolkit,Practical Business Python,Intros to Pandas Data Structure) andMatplotlib books.

Other resources:

10 minutes to Pandas

Pandas for machine learning

100 NumPy exercises

3. Mining and scraping the data from websites and through APIs

Once you have understood the basics of python and the most important modules you have to learn how to collect data from different sources. This technique is also called web scrapping. Classic sources are text from websites, textual data through APIs to access websites such as twitter or linkedin. Good books on web scraping areMining the Social Web (free book!),Web Scraping with PythonandWeb Scraping with Python: Collecting Data from the Modern Web.

Lastly this textual data has to be transformed into numerical data, which is done with natural language processing techniques covered byNatural language processing with PythonandNatural Language Annotation for Machine Learning. Other data are images and videos, which can be analysed using computer vision techniques:Programming Computer Vision with Python,Programming Computer Vision with Python: Tools and algorithms for analyzing imagesandPractical Python and OpenCVare typical resources to analyse images.

Educational and interesting examples of what you can already do using basic python commands and web scraping techniques can be found in these examples:

Mini-Tutorial: Saving Tweets to a Database with Python

Web Scraping Indeed for Key Data Science Job Skills

Case Study: Sentiment Analysis On Movie Reviews

First Web Scraper

Sentiment Analysis of Emails

Simple Text Classification

Basic Sentiment Analysis with Python

Twitter sentiment analysis using Python and NLTK

Second Try: Sentiment Analysis in Python

Natural Language Processing in a Kaggle Competition for Movie Reviews

4. Machine learning with Python

Machine learning can be divided into four groups. Classification, clustering, regression and dimensionalty reduction.

Classification can also be called supervised learning and helps one to classify an image in order to identify a symbol or face in the image, or to classify a user from its profile and to grant him different credit scores. Clustering happens under unsupervised learning and allows the user to identify groups/clusters within its data. Regression permits to estimate a value from a paramter set and can be used to predict the best price for a house, apartment or car.

All importantmodules, packages and techniquesto learn Machine Learning in Python, C, Scala, Java, Julia, MATLAB, Go, R and Ruby. Books about machine learning in python:

I especially recommend the bookMachine learning in action. Although a bit short it is probably a classic in machine learning due to its ageProgramming Collective Intelligence. These two books let you build machine learning algorithms from scratch.

Most recent publications about machine learning are base on the Python module scikit-learn. It makes machine learning very easy since all the algorithm are already implemented. The only thing you do is to tell python which ML-technique should be used to analyse the data.

A free scikit-learn tutorialcan be found on the official scikit-learn website. Other posts are be found here:

Introduction to Machine Learning with Python and Scikit-Learn

Data Science in Python

Machine Learning for Predicting Bad Loans

A Generic Architecture for Text Classification with Machine Learning

Using Python and AI to predict types of wine

Advice for applying Machine Learning

Predicting customer churn with scikit-learn

Mapping Your Music Collection

Data Science in Python

Case Study: Sentiment Analysis on Movie Reviews

Document Clustering with Python

Five most popular similarity measures implementation in python

Case Study: Sentiment Analysis on Movie Reviews

Will it Python?

Text Processing in Machine Learning

Hacking an epic NHL goal celebration with a hue light show and real-time machine learning

Vancouver Room Prices

Exploring and Predicting University Faculty Salaries

Predicting Airline Delays

Books about machine learning and the module scikit-learn in Python are:

Collection of books on reddit

Building Machine Learning Systems with Python

Building Machine Learning Systems with Python, 2nd Edition

Learning scikit-learn: Machine Learning in Python

Machine Learning Algorithmic Perspective

Data Science from Scratch – First Principles with Python

Machine Learning in Python

Books which are published in the coming months are:

Introduction to Machine Learning with Python

Thoughtful Machine Learning with Python: A Test-Driven Approach

Courses and blogs about Machine learning

You want to earn a degree, take an online course or attand a real workshop, camp or university course? Here are some links:Collection of linksto online education in analytics, Big Data, Data Mining, and Data Science. Coursera course inmachine learningandData Analyst Nanodegreefrom Udacity are other recommended online courses.List of frequently updatedblogs about machine learning.

A great youtube video is this class fromJake Vanderplas, Olivier Grisel about Exploring Machine Learning with Scikit-learn!

Theory of Machine Learning

Want to learn the theory of machine learning?The Elements of statistical LearningandIntroduction to Statistical Learningare often cited classics. Other books areIntroduction to machine learningandA Course in Machine Learning. The links contain free PDF, so you don’t have to pay them! Don’t want to read this? Watch15 hours theory of machine learning!

Please follow and like us:

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 217,542评论 6 504
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 92,822评论 3 394
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 163,912评论 0 354
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 58,449评论 1 293
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 67,500评论 6 392
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 51,370评论 1 302
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 40,193评论 3 418
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 39,074评论 0 276
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 45,505评论 1 314
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 37,722评论 3 335
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 39,841评论 1 348
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 35,569评论 5 345
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 41,168评论 3 328
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 31,783评论 0 22
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 32,918评论 1 269
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 47,962评论 2 370
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 44,781评论 2 354

推荐阅读更多精彩内容

  • 追求,就是追和求和组合出击! 追的原因是自己和理想恋人有差距,需要奋起直追,从自我发展做起!不但,要完善自己的性格...
    世说新解阅读 141评论 0 0
  • 温水流过指尖 咖啡飘荡香气 夕阳晕染霞光 雨滴划过天际 秋风吹落枯叶 是你么 时间 稚童挥手告别 游子推开家门 新...
    说书客阅读 198评论 0 0