PROBLEM 1 Select frequent words (whose count is equal or


PROBLEM 1

Select frequent words (whose count is equal or greater than 50,000). 

Display the frequent words in descending order. 

PROBLEM 2

Get groups of words by their length (Hint: use the built-in function SIZE) and count each group.

For example,

(2,1096049) means that there are 1096049 occurrence of words that have two characters.

Problem 3 is based on dataset nyc_taxi_data_2014.csv.gz

PROBLEM 3

Find the effect of passenger_count on trip_distance, fare_amount, and tip_rate.

a) Create a new data set records2 that has passenger_count, trip_distance, fare_amount, 

tip_rate (tip_amount/total_amount)

b) Filter records2 by passenger_count (0 < passenger_count < 10) and name the data set as 

records3

c) Group records3 by passenger_count. 

d) Display the average trip_distance, average fare_amount, and average tip_rate per each 

group of passenger_count.

Share This Post

Email
WhatsApp
Facebook
Twitter
LinkedIn
Pinterest
Reddit

Order a Similar Paper and get 15% Discount on your First Order

Related Questions

Obtain the general solution of the following DEs: i. y′′′ + y′′ − 4y′ + 2y = 0 ii. y(4) + 4y(2) = 0 iii. x(x − 2)y′′ + 2(x − 1)y′ − 2y = 0; use y1 = (1 − x) iv. y′′ − 4y = sin2(x) v. y′′ − 4y′ + 3y = x ; use y1 = e3x vi. y′′ + 5y′ + 6y = e2xcos(x) vii. y′′ + y = sec(x) tan(x)

Obtain the general solution of the following DEs: i. y′′′ + y′′ − 4y′ + 2y = 0 ii. y(4) + 4y(2) = 0 iii. x(x − 2)y′′ + 2(x − 1)y′ − 2y = 0; use y1 = (1 − x) iv. y′′ − 4y = sin2(x) v. y′′