BUS255 Introduction to Data Analytics in R

Credits (ECTS):5

Course responsible:Ritvana Rrukaj

Campus / Online:Online

Teaching language:Engelsk

Course frequency:Annually

Nominal workload:125 hours

Teaching and exam period:This course starts in Spring parallel. This course has teaching/evaluation in Spring parallel, .

About this course

In this course you will learn to use R to solve common problems in data analysis. You will gain fundamental knowledge about data structures, analysis, and visualization.

Some of the key benefits of working in R are that all your work and analysis is fully reproducible, you can work with large datasets, continuous streams of data, utilize state-of-the-art modeling and data visualization techniques, and much more.

The course is divided into 4 parts:

  1. Data exploration (4 weeks)
  2. Data wrangling (4 weeks)
  3. Programming in R (3 weeks)
  4. Models in R (2 weeks)

The course will cover the following topics

  • Common data structures and data sources
  • Common file formats and data import
  • R and R Studio
  • Quarto
  • Data transformation
  • Data and model visualization using `ggplot`
  • Exploratory data analysis
  • Programming concepts (Functions, vectors and iterations)
  • Model building
  • A key learning outcome is effective written communication. You need to be able to communicate clearly about the choices you made before and during data gathering, cleaning and analysis; and you need to communicate the results using text, tables, and visualizations. You will learn to create reproducible reports and presentations using Quarto. If properly set up, you will see that all you have to when you get new data is to re-run your code to produce a new report with updated numbers and figures.

Learning outcome

Knowledge:

  • Understand the properties of raw data structures and their implications for the use of data analysis techniques
  • Be familiar with database structures and their implications for data management and data extraction
  • Know important techniques for data preparation, transformation, aggregation and exploration
  • Understand how pre-analysis choices, e.g., aggregating or dropping observations, affect analysis and interpretation of results
  • Understand what compromises may be necessary in the data analysis process from raw data to discussion and presentation of results and how these may affect or bias the results
  • Understand how programming can automate data analysis tasks, reduce errors and increase reproducibility of results.

Skills:

  • Have basic skills in R and R Studio
  • Be able to read in data from various sources and file formats, e.g., SQL databases, Excel, XML data streams
  • Be able to take "messy" raw data and prepare it for analysis
  • Be able to perform basic feature engineering tasks, e.g., variable selection, transformation, and aggregation
  • Be able to create informative tables and visualizations of data and analysis results
  • Create reproducible reports and presentations using Quarto

General competence:

  • Effectively communicate the results of data analysis using text, tables, and visualizations
  • Be able to build logical arguments and justify data- and analysis choices
  • Be able to ask technical questions in a way that others can come in and help with the solution.
  • Learning activities
    «Flipped classroom» with pre-recorded videos and resources available on Canvas. Weekly campus based and / or online seminars.
  • Teaching support

    A FAQ is available on Canvas. Online fora such as Stack Overflow can be used to learn from existing questions and answers, and to ask questions on a form so that others can help.

    Weekly automatically graded problem sets to help students revise the material is available on Canvas.

  • Prerequisites
    STAT100 or equivalent. Given the course structure, BUS255 can be taken at the same time as STAT100.
  • Assessment method
    Portfolio assessment comprising of 2 assignments.

    Portfolio Grading: Passed / Not Passed
  • Examiner scheme
    External examiner will control the quality of syllabus, questions for the final examination, and principles for the assessment of the examination answers.
  • Notes
    There is a master's version of this subject, BUS350, which is offered in the autumn.
  • Teaching hours
    • Lectures - prerecorded videos, approx. 18 hours
    • Supervised exercises - on campus or digital, 2 hours per week, approx. 26 hours.
  • Reduction of credits
    5 ECTS overlap with BUS350 and STIN300
  • Admission requirements
    Minimum requirements for entrance to higher education in Norway (generell studiekompetanse). This course is for bachelorstudents at NMBU. Students who can not register for the course, can contact studieveileder-hh@nmbu.no.