Skip to main content

Goodreads . The dataset is downloaded from Kaggle

Resource Information

In this assignment, you should work with books.csv file. This file contains the detailed information about books scraped via the Goodreads . The dataset is downloaded from Kaggle website: https://www.kaggle.com/jealousleopard/goodreadsbooks/downloads/goodreadsbooks.zip/6
Each row in the file includes ten columns. Detailed description for each column is provided in the following:
  1. bookID: A unique Identification number for each book.
  2. title: The name under which the book was published.
  3. authors: Names of the authors of the book. Multiple authors are delimited with -.
  4. average_rating: The average rating of the book received in total.
  5. isbn: Another unique number to identify the book, the International Standard Book Number.
  6. isbn13: A 13-digit ISBN to identify the book, instead of the standard 11-digit ISBN.
  7. language_code: Helps understand what is the primary language of the book. 
  8. num_pages: Number of pages the book contains.
  9. ratings_count: Total number of ratings the book received.
  10. text_reviews_count: Total number of written text reviews the book received.

Task

  1. Write the following codes:
    1. Use pandas to read the file as a dataframe (named as books). bookIDcolumn should be the index of the dataframe.
    2. Use books.head() to see the first 5 rows of the dataframe.
    3. Use book.shape to find the number of rows and columns in the dataframe.
    4. Use books.describe() to summarize the data.
    5. Use books['authors'].describe() to find about number of unique authors in the dataset and also most frequent author.
    6. Use OLS regression to test if average rating of a book is dependent to number of pages, number of ratings, and total number of written text reviews the book received.
  2. Summarize your findings in a Word file.

Instructions

 Please follow these directions carefully.

  1. Please type your codes in a Jupyter Network file and your summary in a word document named as follows:
    HW6YourFirstNameYourLastName.
  2. Assignment statusSolved by our Writing Team at EssayCabinet.com
  3.  
  4. CLICK HERE TO ORDER THIS PAPER AT EssayCabinet.com

Comments

Popular posts from this blog

Identify and discuss a key milestone in the history of computers that interests you and why.

  Part 1Title: Lab ResponseDiscuss one feature of MS Word and one feature of MS Excel that you found challenging within the lab and why. Examples are WordArt, inserting shapes, adding borders, cell styles, etc. This response should be at least one paragraph in length. Part 2Title: History of Computers Identify and discuss a key milestone in the history of computers that interests you and why. This section should be at least one paragraph. Part 3Title: System Software vs. Application Software In your words, explain the difference between application software and system software as if to another coworker who has limited technical knowledge. Use examples to support your rationalization. This section should be at least two paragraphs. Part 4Title: Blockchain and Cryptocurrency In a minimum of one paragraph each: 1. Conduct some research on the internet and discuss one underlying technology of cryptocurrencies like blockchain, cryptography, distributed ledger technol...

Cybersecurity and Infrastructure Security (CISA)

 Develop a research paper that identifies a specific Department of Homeland Security (DHS) operating agency. Fully describe 1 DHS operating agency from the following list: Cybersecurity and Infrastructure Security (CISA) U.S. Customs and Border Protection (CBP) U.S. Citizenship and Immigration Services (USCIS) Federal Emergency Management Agency (FEMA) U.S. Coast Guard (USCG) U.S. Immigration and Customs Enforcement (ICE) U.S. Secret Service (USSS) Transportation Security Administration (TSA) The information must include a discussion of the selected DHS agency. Identify the agency’s mission, goals, objectives, and metrics. Conduct an analysis of how these mission areas address the threats or challenges. Recommend agency program priorities among the current set of goals, objectives, metrics, or budget items. Justification of all choices is an essential element of this assignment. Reference all source material and citations using APA format. WE OFF...

You have been given the t

PPMP20009 Leading Lean Projects Term 2, 2018 Page 1 of 3 Assessment 1 (Individual Work – 40%) Case study: Managing Lean Project Introduction You have been given the task to organise the FIFA 2018 World Cup football tournament ( https://ift.tt/2If0UDE ). (For those not familiar with football, you may change the event to the recently concluded 2018 Gold Coast Commonwealth Games: https://gc2018.com/ ). Your tasks This assessment item requires you to work INDIVIDUALLY. 1. As the project manager in charge of delivering such as an important world sporting event: i. Identify and elaborate the following terms with regards to the event. a. Voice of the Customer (VOC) b. Critical to Customer Requirements (CCR) c. Critical to Quality parameters (CTQ) ii. In what ways would you have implemented lean project management for this event? Identify and elaborate processes from PMBOK, PRINCE2, Organisational Project iii. Management Agile Practices and Lean Six Sigma (DMAIC Me...