数据分析代写 | MAT022 Foundations of Statistics and Data Science Resit Coursework

本次英国代写是数据分析的一个Coursework

Summative re-assessment for the module is by means of a single report on your statistical
analysis of a data set related to the National Basketball Association (NBA), a professional
basketball league in the USA. Please read this document carefully.

This form of assessment has been chosen because as professional statisticians and data scientists,
you will often be asked to investigate a data set and report on whether it contains anything useful
or interesting. The assessment will also help you to prepare for writing your MSc dissertation
in the summer.

Your report will be assessed according to how well you are able to

 analyse the data set, 40%
 interpret the results of your analysis, and 30%
 present the results of your analysis and your interpretation of the data set. 30%

Your analysis should be performed using the R statistical software package, and your report
prepared using the R Markdown typesetting system and the template provided. Two marks will
be deducted for reports prepared using alternative systems such as Microsoft Word or LaTeX.

1 The data

The data set NBA_sample.csv is a partial record of shots taken by players in the NBA between
October 2014 and March 2015, and consists of 50,000 observations on 20 variables as described
in Table 1. A summary of the changes made to the data set provided for the original coursework
can be found at the end of this document.

Variable                                        Description
GAME_ID                                    Unique id number of the game.
DATE                                             Date of the game.
HOME_TEAM                             Team playing at home.
AWAY_TEAM                              Team playing away from home.
PLAYER_NAME                          Name of the shooting player.
PLAYER_ID                                  Unique id number of the shooting player.
LOCATION                                    Whether the player was on the home (H) or away (A) team.
WIN_LOSE                                   Whether the player’s team won (W) or lost (L) the game.
SHOT_NUMBER                         The number of the shot taken by the shooting player in that game.
PERIOD                                          The period of the game that the shot was taken.
SEC_REMAIN                              The number of seconds before the end of the period that the shot was taken.
SHOT_CLOCK                              The time remaining before the shot must be taken.
DRIBBLES                                     Number of dribbles by the player before the shot was taken.
TOUCH_TIME                              The time that the ball was in the shooting player’s hand.
SHOT_DIST                                  The distance of the shooting player from the basket.
PTS_TYPE                                      2 for shots from inside the arc, 3 for shots from outside the arc.
CLOSEST_DEFENDER              Name of the nearest defender when the shot was taken.
CLOSEST_DEFENDER_ID      Unique id number of the nearest defender.
CLOSE_DEF_DIST                     Distance of the nearest defender when the shot was taken.
SUCCESS                                        Equal to 1 if the shot was made (scored), otherwise 0.