Skip to main content

Goodreads . The dataset is downloaded from Kaggle

Resource Information

In this assignment, you should work with books.csv file. This file contains the detailed information about books scraped via the Goodreads . The dataset is downloaded from Kaggle website: https://www.kaggle.com/jealousleopard/goodreadsbooks/downloads/goodreadsbooks.zip/6
Each row in the file includes ten columns. Detailed description for each column is provided in the following:
  1. bookID: A unique Identification number for each book.
  2. title: The name under which the book was published.
  3. authors: Names of the authors of the book. Multiple authors are delimited with -.
  4. average_rating: The average rating of the book received in total.
  5. isbn: Another unique number to identify the book, the International Standard Book Number.
  6. isbn13: A 13-digit ISBN to identify the book, instead of the standard 11-digit ISBN.
  7. language_code: Helps understand what is the primary language of the book. 
  8. num_pages: Number of pages the book contains.
  9. ratings_count: Total number of ratings the book received.
  10. text_reviews_count: Total number of written text reviews the book received.

Task

  1. Write the following codes:
    1. Use pandas to read the file as a dataframe (named as books). bookIDcolumn should be the index of the dataframe.
    2. Use books.head() to see the first 5 rows of the dataframe.
    3. Use book.shape to find the number of rows and columns in the dataframe.
    4. Use books.describe() to summarize the data.
    5. Use books['authors'].describe() to find about number of unique authors in the dataset and also most frequent author.
    6. Use OLS regression to test if average rating of a book is dependent to number of pages, number of ratings, and total number of written text reviews the book received.
  2. Summarize your findings in a Word file.

Instructions

 Please follow these directions carefully.

  1. Please type your codes in a Jupyter Network file and your summary in a word document named as follows:
    HW6YourFirstNameYourLastName.
  2. Assignment statusSolved by our Writing Team at EssayCabinet.com
  3.  
  4. CLICK HERE TO ORDER THIS PAPER AT EssayCabinet.com

Comments

Popular posts from this blog

You have been given the t

PPMP20009 Leading Lean Projects Term 2, 2018 Page 1 of 3 Assessment 1 (Individual Work – 40%) Case study: Managing Lean Project Introduction You have been given the task to organise the FIFA 2018 World Cup football tournament ( https://ift.tt/2If0UDE ). (For those not familiar with football, you may change the event to the recently concluded 2018 Gold Coast Commonwealth Games: https://gc2018.com/ ). Your tasks This assessment item requires you to work INDIVIDUALLY. 1. As the project manager in charge of delivering such as an important world sporting event: i. Identify and elaborate the following terms with regards to the event. a. Voice of the Customer (VOC) b. Critical to Customer Requirements (CCR) c. Critical to Quality parameters (CTQ) ii. In what ways would you have implemented lean project management for this event? Identify and elaborate processes from PMBOK, PRINCE2, Organisational Project iii. Management Agile Practices and Lean Six Sigma (DMAIC Me...

Identify and discuss a key milestone in the history of computers that interests you and why.

  Part 1Title: Lab ResponseDiscuss one feature of MS Word and one feature of MS Excel that you found challenging within the lab and why. Examples are WordArt, inserting shapes, adding borders, cell styles, etc. This response should be at least one paragraph in length. Part 2Title: History of Computers Identify and discuss a key milestone in the history of computers that interests you and why. This section should be at least one paragraph. Part 3Title: System Software vs. Application Software In your words, explain the difference between application software and system software as if to another coworker who has limited technical knowledge. Use examples to support your rationalization. This section should be at least two paragraphs. Part 4Title: Blockchain and Cryptocurrency In a minimum of one paragraph each: 1. Conduct some research on the internet and discuss one underlying technology of cryptocurrencies like blockchain, cryptography, distributed ledger technol...

Which personal characteristics do you consider essential for a successful career in human services? Which characteristics should be considered detrimental?Â

Which personal characteristics do you consider essential for a successful career in human services? Which characteristics should be considered detrimental? Â Need a Professional Writer to Work on this Paper and Give you a Quality and Original Paper? Click Here to Get this Paper at an Affordable Price…………