欧美色欧美亚洲另类七区,惠美惠精品网,五月婷婷一区,国产亚洲午夜

課程目錄:Machine Learning – Data science培訓
4401 人關注
(78637/99817)
課程大綱:

    Machine Learning – Data science培訓

 

 

 

Machine Learning introduction
Types of Machine learning – supervised vs unsupervised learning
From Statistical learning to Machine learning
The Data Mining workflow:
Business understanding
Data Understanding
Data preparation
Modelling
Evaluation
Deployment
Machine learning algorithms
Choosing appropriate algorithm to the problem
Overfitting and bias-variance tradeoff in ML
ML libraries and programming languages
Why use a programming language
Choosing between R and Python
Python crash course
Python resources
Python Libraries for Machine learning
Jupyter notebooks and interactive coding
Testing ML algorithms
Generalization and overfitting
Avoiding overfitting
Holdout method
Cross-Validation
Bootstrapping
Evaluating numerical predictions
Measures of accuracy: ME, MSE, RMSE, MAPE
Parameter and prediction stability
Evaluating classification algorithms
Accuracy and its problems
The confusion matrix
Unbalanced classes problem
Visualizing model performance
Profit curve
ROC curve
Lift curve
Model selection
Model tuning – grid search strategies
Examples in Python
Data preparation
Data import and storage
Understand the data – basic explorations
Data manipulations with pandas library
Data transformations – Data wrangling
Exploratory analysis
Missing observations – detection and solutions
Outliers – detection and strategies
Standarization, normalization, binarization
Qualitative data recoding
Examples in Python
Classification
Binary vs multiclass classification
Classification via mathematical functions
Linear discriminant functions
Quadratic discriminant functions
Logistic regression and probability approach
k-nearest neighbors
Na?ve Bayes
Decision trees
CART
Bagging
Random Forests
Boosting
Xgboost
Support Vector Machines and kernels
Maximal Margin Classifier
Support Vector Machine
Ensemble learning
Examples in Python
Regression and numerical prediction
Least squares estimation
Variables selection techniques
Regularization and stability- L1, L2
Nonlinearities and generalized least squares
Polynomial regression
Regression splines
Regression trees
Examples in Python
Unsupervised learning
Clustering
Centroid-based clustering – k-means, k-medoids, PAM, CLARA
Hierarchical clustering – Diana, Agnes
Model-based clustering - EM
Self organising maps
Clusters evaluation and assessment
Dimensionality reduction
Principal component analysis and factor analysis
Singular value decomposition
Multidimensional Scaling
Examples in Python
Text mining
Preprocessing data
The bag-of-words model
Stemming and lemmization
Analyzing word frequencies
Sentiment analysis
Creating word clouds
Examples in Python
Recommendations engines and collaborative filtering
Recommendation data
User-based collaborative filtering
Item-based collaborative filtering
Examples in Python
Association pattern mining
Frequent itemsets algorithm
Market basket analysis
Examples in Python
Outlier Analysis
Extreme value analysis
Distance-based outlier detection
Density-based methods
High-dimensional outlier detection
Examples in Python
Machine Learning case study
Business problem understanding
Data preprocessing
Algorithm selection and tuning
Evaluation of findings
Deployment

主站蜘蛛池模板: 黄浦区| 桃江县| 赞皇县| 邵阳县| 克什克腾旗| 哈尔滨市| 施甸县| 宜兰县| 绩溪县| 萝北县| 西宁市| 德江县| 新干县| 白朗县| 敦煌市| 安义县| 洪洞县| 高雄县| 黔南| 喀喇沁旗| 蓬溪县| 古蔺县| 正镶白旗| 井冈山市| 河北省| 冀州市| 玉溪市| 特克斯县| 丰都县| 常德市| 海门市| 阿拉善右旗| 沾益县| 大田县| 九寨沟县| 枣强县| 南投县| 历史| 抚顺市| 和硕县| 金平|