Introduction to Machine Learning in R (2-days workshop)

A workshop for people who are beginning their association with artificial intelligence (AI). You will be introduced to modern machine learning techniques from the ground up. Machine learning is currently used in many companies from different sectors of the economy, and the number of applications and demand for services related to it is still growing. This innovative area has applications in many fields: business, science, marketing, computer science. Every day we can observe many examples of how machine learning works: autocorrect while typing, spam filtering, gene sequence analysis, sales prediction, market research, clustering of shoppers, virtual assistants on websites. The training will be continued at the course on deep learning in R and is a live workshop with a progress check for participants and an opportunity to ask questions. Participants will receive an electronic certificate of completion after successfully passing the workshop.

Date: to be agreed with the client (2-days online workshop).

Time: 10 hours (5 hours each day with a coffee break).

Location: online.

Group: for a group larger than 3 persons.

Teacher: Tomasz Górecki, Ph.D.

Price: 445 EUR (plus VAT if applicable) per person.

Topics:

  • Introduction to tidyverse library – data preparation
  • Introduction to ggplot2 library – data visualization
  • Data Imputation
  • Detection of outliers
  • Coding of qualitative data
  • Linear and multiple regression
  • Collinearity of variables, nonlinear and logistic regression
  • Data dimension reduction: PCA and kernel PCA, t-SNE and UMAP methods, multidimensional scaling, correspondence analysis
  • Cluster analysis: hierarchical and non-hierarchical methods (k-means, PAM), DBSCAN method
  • Classification: nearest neighbor method, SVM, neural networks, decision trees, random forests

Learning Outcomes

After completing the course, the participant will:

  1. Be able to work with the R programming language, enabling independent data analysis and development in the field of machine learning.
  2. Be familiar with the tidyverse library, which allows for:
    1. Efficient data preparation for analysis,
    2. Data loading, transformation, and cleaning.
  3. Be able to use the ggplot2 library to create advanced data visualizations, enhancing understanding and presentation of results.
  4. Be capable of handling missing data using imputation techniques and detecting and interpreting outliers (anomalies).
  5. Know methods for encoding categorical data, enabling their use in advanced analyses and machine learning models.
  6. Be able to perform dependency analyses, such as:
    1. Linear regression,
    2. Multiple regression,
    3. Non-linear regression,
    4. Logistic regression,
  7. Be able to take into account problems of collinearity of variables.
  8. Understand and apply advanced dimensionality reduction techniques, including:
    1. PCA (Principal Component Analysis),
    2. Kernel PCA,
    3. t-SNE,
    4. UMAP,
    5. Multidimensional Scaling,
    6. Correspondence Analysis.
  9. Be capable of conducting clustering analyses, including hierarchical and non-hierarchical methods, such as:
    1. K-means clustering,
    2. PAM (Partitioning Around Medoids),
    3. DBSCAN.
  10. Know and be able to apply data classification techniques, such as:
    1. k-Nearest Neighbors (k-NN),
    2. Support Vector Machines (SVM),
    3. Neural networks,
    4. Decision trees,
    5. Random forests.
  11. Be able to build and evaluate predictive models and apply them to practical scenarios.
  12. Possess the ability to independently:
    1. Conduct comprehensive data analyses,
    2. Create advanced visualizations,
    3. Build and deploy machine learning models to support business and scientific decision-making.

Tomasz Górecki, Ph.D., D.Sc., Associate Professor

Professor at the Department of Mathematical Statistics and Data Analysis at the Faculty of Mathematics and Computer Science of Adam Mickiewicz University in Poznań, Vice-Dean for grants and cooperation with business at the Faculty of Mathematics and Computer Science of Adam Mickiewicz University in Poznań. His scientific interests include time series analysis, functional data analysis and artificial intelligence. He also deals with applications of statistical methods and machine learning in practical problems. He has been teaching probability calculus, statistics, machine learning, and artificial intelligence for over twenty years. He cooperates with many companies in the field of machine learning and artificial intelligence (Lidl, OLX, PWN, Samsung, Smartstock). He has been using R language for many years (he is a co-author of two R packages and an author of a book on R language). He cooperates with employees of Colorado State University, University of Utah, Poznan University of Technology, Poznan University of Economics and Poznan University of Life Sciences.

 


 

Registration End Date

30-09-2025
 

Share Event

Zapisz się do naszego Newslettera!
Zapisz się do naszego newslettera i otrzymaj 10% zniżki przy pierwszym zamówienia na jeden z naszych dwudniowych warsztatów, bieżące informacje dotyczące naszych produktów i usług oraz nowości ze świata bioinformatyki!
icon