2020-2021 Group Coursework
MAT013 Group Coursework
Deadline: May 21, 2021
Instructions
The outputs of this coursework will be:
- A 20 minute group presentation/demonstration recorded using zoom (or another video recording software).
- All relevant files (code, presentation, notes, websites, demo materials, etc) should be passed to Andrey Pepelyshev on or before May 21, 2021.
Marking criteria:
- Difficulty: [30]
- Accuracy: [30]
- Originality: [20]
- Presentation/demonstration: [20]
Coursework
The recommended size of the group is 2-4 students.
As a group you are required to present how to solve a particular scientific problem using R. You should use aspects of R that are not given in the notes. The presentation should be viewed as a teaching presentation. You should proceed as follows:
- Choose a scientific problem. Some examples of statistical problems are:
- Logistic/Binary/Poisson regression
- Classification/Clustering
- Discrimination analysis
- Multidimensional scaling
- Pattern recognition
- Time series analysis and forecasting
- Choose a package for R. Some examples are gbm, xgboost, LiblineaR, cluster, MASS, smacof, superMDS, neuralnet, deepnet, Rssa.
- Find a dataset for demonstrating how to use the chosen package for solving the chosen problem. Usually, each package contains references to few suitable datasets.
- Your group should write the choice of a problem, a package and a dataset in "Discussion" at Learning Central in order to avoid the same choice by other groups. On selection of a topic, it is advisable to ask Andrey Pepelyshev whether or not it is suitable.
- In your group coursework, (i) explain a problem, (ii) explain certain technical aspects of a package, (iii) explain solution of a particular problem for a dataset.
You are not constrained by the use of slides (although you are welcome to). Feel free to be imaginative.