数据可视化代写|7CCSMSDV Introduction to Data Visualization Summer 2023 (Period 2)

这是一篇来自英国的关于数据可视化技术的代写

 

Introduction

This assignment is divided into 3 parts: Analytics, Design and Prototyping,Implementation. Please read carefully the instructions related to each part.

Data Scientists are called to put their skills to action to help mine the large and diverse amount of data available to support knowledge discovery in front of diverse .

To this aim the theme of this assignment will be focusing on analysis of three main topics: cost of living, social media, sport events.

The assignment has therefore three main themes (or tracks). You are required to choose ONE.

Proposed Tracks:

  1. Cost of Living data analysis;
  2. Social Media data analysis;
  3. Sport data analysis.

Data. As a starting point we are providing you with:

  • Three starting datasets reporting on: (i) Cost of living increase in London

(ii) Social media influence on shopping (iii) Formula 1 World Championship from 1950 to 2023.

At least one of the three provided datasets should be used for the assignment. [Datasets available on module Keats page]

  • A series of links to relevant data repositories and projects. These are given as a starting point, it is not mandatory to use them and you are invited to look for any other datasets available on the web that you may find interesting. [List provided on module Keats page]

Visualizations. We are providing you with links to existing visualizations, again these are given as a starting point and for inspiration, it is not mandatory to follow them and you are invited to look for other visualizations that are available on the web and that you may find inspirational. [List provided on module Keats page]

Research questions. You are provided with an initial research question for each track:

Q1 – Track 1: “Analyze the development of cost-of-living over time. Are there any detectable trends?”

Q1 – Track 2: “Analyze the effects of social media influence on shopping. Are there any detectable trends?”

Q1 – Track 3: “Analyze performances of Formula 1 teams. Are there any detectable trends?”

Part 1. Analytics [20 marks total]

Based on the track you chose and the respective data provided in the Data section and/or other datasets you may have encountered or may find online,you are required to:

a.Propose two or more exploratory research questions (non-trivial questions) beyond Q1, label them as Q2, Q3, etc. [Note: You can propose questions that might require access to datasets beyond the one provided, these can be questions complementary to Q1, investigating other aspects than the ones investigated by Q1, or refinements/expansions of Q1, you are free to decide.]. [10 marks]

b.Explain what type of data could be used to answer Q1 and each one of the questions you proposed in a.. Assess the appropriateness of each dataset(s) (the ones we provided and other resources you may have found, please provide links to the latter), you would potentially be using to answer the questions. [5 marks]

c.Explain if and how datasets, you described in b., are or could be correlated. [5 marks]

Part 2. Design and Discussion [20 marks total]

Based on visualization approaches surveyed in class and in recommended readings:

a.Propose and design a minimum of 3 visualizations that would answer the research questions proposed in Part 1 (creativity will be rewarded).

By design we mean drawing/sketching a prototype. Design can be handdrawn on paper or using a tool of your choice e.g. PowerPoint, Sketch,Illustrator, D3, Tableau, etc. [9 marks]

b.Each visualization should be accompanied by a maximum of 300 words describing the design rationale, which question(s) your design would help answer and if/how your design may improve upon existing examples.

By design rationale we mean: the process and principles followed in choosing the specific visualization. You should provide a rigorous rationale for your design decisions, e.g. visual encodings used and why they are appropriate for the data. These decisions include the choice of visualization type, size, colour, scale, mark and channels and other visual elements, as well as the use of sorting or other data transformations.

Consider how these decisions facilitate analysis and/or communication. [11 marks]

Note: In this part we are only asking to design possible visual layouts not to implement them. If you are hand-drawing your designs take pictures and add them to your document as figures. If you are using a tool to develop your designs save them as images and add them to your document.

Part 3. Implementation [30 marks]

Out of the visualizations proposed in Part 2 implement one in D3 as a webapp. You shall use the data (part or whole of it) provided for your track, you can also complement the data with other data sources you foraged yourself.

Your visualization shall support answering one of the research questions,therefore:

  • it shall be accompanied by a short description of how data are being processed (and acknowledgement of your data source(s)). [10 marks]
  • it can include either a composition of linked/related simple visual layouts or a more sophisticated single visual layout. [10 marks]
  • it shall allow some level of user interaction. [10 marks]

Note: You are allowed to use D3 Example code available on the web as long as it is adapted to your data and you explicitly acknowledge the source of the original code.

The implementation is to be done using D3, visualizations done with other software and embedded in an html page will not be accepted.