# 数据分析代写 | MAT022 Foundations of Statistics and Data Science Resit Coursework

Summative re-assessment for the module is by means of a single report on your statistical
analysis of a data set related to the National Basketball Association (NBA), a professional

This form of assessment has been chosen because as professional statisticians and data scientists,
you will often be asked to investigate a data set and report on whether it contains anything useful
in the summer.

Your report will be assessed according to how well you are able to

 analyse the data set, 40%
 interpret the results of your analysis, and 30%
 present the results of your analysis and your interpretation of the data set. 30%

Your analysis should be performed using the R statistical software package, and your report
prepared using the R Markdown typesetting system and the template provided. Two marks will
be deducted for reports prepared using alternative systems such as Microsoft Word or LaTeX.

1 The data

The data set NBA_sample.csv is a partial record of shots taken by players in the NBA between
October 2014 and March 2015, and consists of 50,000 observations on 20 variables as described
in Table 1. A summary of the changes made to the data set provided for the original coursework
can be found at the end of this document.

Variable                                        Description
GAME_ID                                    Unique id number of the game.
DATE                                             Date of the game.
HOME_TEAM                             Team playing at home.
AWAY_TEAM                              Team playing away from home.
PLAYER_NAME                          Name of the shooting player.
PLAYER_ID                                  Unique id number of the shooting player.
LOCATION                                    Whether the player was on the home (H) or away (A) team.
WIN_LOSE                                   Whether the player’s team won (W) or lost (L) the game.
SHOT_NUMBER                         The number of the shot taken by the shooting player in that game.
PERIOD                                          The period of the game that the shot was taken.
SEC_REMAIN                              The number of seconds before the end of the period that the shot was taken.
SHOT_CLOCK                              The time remaining before the shot must be taken.
DRIBBLES                                     Number of dribbles by the player before the shot was taken.
TOUCH_TIME                              The time that the ball was in the shooting player’s hand.
SHOT_DIST                                  The distance of the shooting player from the basket.
PTS_TYPE                                      2 for shots from inside the arc, 3 for shots from outside the arc.
CLOSEST_DEFENDER              Name of the nearest defender when the shot was taken.
CLOSEST_DEFENDER_ID      Unique id number of the nearest defender.
CLOSE_DEF_DIST                     Distance of the nearest defender when the shot was taken.
SUCCESS                                        Equal to 1 if the shot was made (scored), otherwise 0.

E-mail: itcsdx@outlook.com  微信:itcsdx