2023年11月25日

SQL代写 | User-centric Systems for Data Science Assignment 2

本次美国作业案例是一个运用Python做一个与查询相关的SQL代写assignment

In this assignment you will extend the operator library you built for Assignment 1 to support transparent
provenance tracking. The resulting library will enable users to retrieve various types of
provenance-related information for individual tuples, such as lineage, Where- and How-provenance. The
last assignment task focuses on the concept of data responsibility, which we will discuss in Lecture 6.
For this and future assignments, you will need to have Python 3.7+, Pytest, and Ray installed in your
machine (cf. Section 10 “Resources” for more information).

You must follow the code skeleton provided in the Gitlab repository. Inline comments will help you
identify the parts of the code you need to fill in. Keep in mind that the assignment does not require writing
much code: the logic of each data operator can be implemented in less than 20 LOC. Always keep your
code simple and well documented.

We will be using a mix of real and synthetic data. Real data include movie ratings from a large Netflix
dataset whereas friendship relationships between users are synthetic. The input data are available in the
Gitlab repository. Make sure you understand the data format first (cf. Section 1 “Data schema”). You
might also want to create a toy dataset of the same format to test your code easily.

1. Data schema

The data we will use for this assignment consist of two CSV files: Friends and Ratings. The former
contains friendship relationships as tuples of the form UserID1 UserID2, denoting two users who are
also friends. A user can have one or more friends and the friendship relationship is symmetric: if A is a
friend of B, then B is also a friend of A and both tuples (A B and B A) are present in the file. Ratings
contains user ratings as tuples of the form UserID MovieID Rating. For example, tuple 12 3 4
means that “the user with ID 12 gave 4 stars to the movie with ID 3”.

Hint #1: You can use Python’s CSV reader to parse input files.

Hint #2: Consider encapsulating your Python tuples in ATuple objects (see code skeleton).

2. TASK I: Implement backward tracing (credits: 40/100)

The first task is to extend the operators you built in Assignment 1 with support for backward tracing. For
each operator, you will have to implement a new method (in Python 3 syntax):

lineage(tuples: List[ATuple]) -> List[List[ATuple]]

that returns the lineage of the given list of tuples.
As discussed in Lecture 2, the lineage of an output tuple, let t, with respect to a query q(D) is the
collection of input tuples that contributed to having the tuple t in the output of the query. Let
recommendation be the output (movie id) of the second query from Assignment 1:

SELECT R.MID
FROM ( SELECT R.MID, AVG(R.Rating) as score
FROM Friends as F, Ratings as R
WHERE F.UID2 = R.UID
AND F.UID1 = ‘A’
GROUPBY R.MID
ORDERBY score DESC
LIMIT 1 )

To successfully complete this task, you must implement a new method for ATuple:

lineage() -> List[ATuple]

so that you can retrieve the lineage of any recommendation as follows:

lineage = recommendation.lineage()

Calling recommendation.lineage() should internally call:

operator.lineage(tuples=[recommendation])

where operator is a handle to the operator that produced the tuple recommendation (i.e. the root
operator of the query tree).

程序代写代做C/C++/JAVA/安卓/PYTHON/留学生/PHP/APP开发/MATLAB

CS代写,留学生编程代写,CS作业代写,Java代写,程序代写，代码代写 | ITCS代写

本网站支持淘宝支付宝微信支付 paypal等等交易。如果不放心可以用淘宝交易！

E-mail:itcsdx@outlook.com 微信:itcsdx

如果您使用手机请先保存二维码，微信识别。如果用电脑，直接掏出手机果断扫描。

人工智能代写 | CSE3OAD/CSE4OAD – Assignment 2 编程代写｜SD6503 Testing and Secure Coding Assignment Two

CONTACT

Assignment Example

Service Scope

Recent Case

2024年10月8日

ITCS代写

SQL代写 | User-centric Systems for Data Science Assignment 2

CONTACT

Assignment Example

Service Scope

Recent Case

MySQL数据库学习指南：留学生如何在不同国家的课程和就业形势下脱颖而出

北美计算机留学高校整理与热门专业前景分析

留学生计算机代写常见服务有哪些？

留学生程序代写靠谱吗

留学生如何选择机器学习方向的专业

Tags