大数据代写|ITS70204 Big Data Management Assignment 3 – Individual

这是一篇关于大数据代写的相关作业,具体需要选择谷歌的大数据项目之一,并研究近年来发表的选定项目的相关学术文章(至少3篇文章,不超过5年)。需要调查项目、数据来源、数据类型、技术和谷歌使用/用来管理和处理大数据的任何相关大数据工具。此外,您必须调查所选项目所涉及的现有数据安全、数据隐私、伦理问题和挑战。根据您对所选谷歌大数据项目的调查结果,您需要编写一份报告来满足以下需求。

 

Use Case: Big Data Management in Google

Google has not only significantly influenced the way we can now analyse big data. But they are probably more responsible than anyone else for making it part of our everyday lives.

Many of the innovations done by Google until now, most companies will do in years to come.

Many people, particularly those who didn’t get online until this century had started, will have had their first direct experience of manipulating big data through Google. Although these days Google’s big data innovation goes well beyond basic search, it’s still their core business. They process 3.5 billion requests per day, and each request queries a database of 20 billion web pages (Marr, 2015). Google has done and still is doing many big data projects and each of them collects data from various sources in very large scale and uses one or multiple big data management solutions to store, process, and retrieve data and extract useful information for different purposes.

Task description:

You are required to select one of the Google’s big data projects and study relevant scholastic articles to the selected project that are published in recent years (at least 3 articles and not older than 5 years). You need to investigate the project, sources of data, types of data,technologies and any relevant big data tools that Google uses/used to manage and process big data. Moreover, you must investigate existing data security, data privacy, and ethical issues and challenges that are involved with the selected project. Based on the findings of your investigation on the selected Google big data project, you need to write a report to address the following requirements.

  1. Propose a database solution using structured, semi-structured, or unstructured models or a combination of them to store and manage the data that is involved in the selected Google project. Your report must cover the following items:

1.1. The proposed solution must be able to store and manage all possible sources and types of big data that are used in the selected Google project. Discuss relevant examples of big data records and cite appropriate evidence from recently published articles.

(10 marks)

1.2. The architecture of the proposed solution must be drawn and discussed properly.

(10 marks)

1.3. A comparison between your proposed database solution and the database system(s) that is/are used by Google to manage the same type of big data, must be included and advantages/disadvantages of them must be discussed.

(10 marks)

  1. Investigate and discuss the rules and policies that your proposed solution must establish to address data ethics, data privacy, and data security requirements. You are required to propose at least THREE (3) rules/policies for each of them (data ethics, data privacy, and data security). Each rule/policy must be supported by scholastic articles and proper citation needs to be added.

(30 marks)

Note: Total: 60 Marks (It will be capped at 30 marks)

Deliverables

The output should be in terms of:

  1. Assignment Report (Softcopy in PDF)
  2. Cover page
  3. References. (APA Referencing Style: www.apa.org or

http://www.apastyle.org/index.aspx or

https://owl.english.purdue.edu/owl/resource/560/01/)

  1. Report of similarity (maximum accepted similarity is 20%)

References:

Marr, B. (2015). Big Data: Using SMART big data, analytics and metrics to make better decisions and improve performance. John Wiley \& Sons.