Our Services

Get 15% Discount on your First Order

Small Python Project

December 9, 2023

Important

· Before you start: Please make sure you have a valid NYTimes account and created an API key!!!! Look at the video on how to create a new API key. You can find it on Canvas in Files/Lecture Videos. The file is titled week14_newyorktimes_api.mp4

· The procedure is straightforward: go to
developer.nytimes.com

· and create an account. Create an App and generate an API Key.

· Your final output should have two python files:

· code to get the data from the NYTimes API, cleaning it, retrieving the relevant fields and storing it in files

· code to analyze the data and present the results

Checklist for your code

Required

· Your code executes successfully

· Your code should have detailed comments

Important. You will lose points if you do not complete all the tasks. However, if your code does not run correctly, you will not be getting partial credit. So it is better to make sure you implement the project in such a way that you cover one goal after another.

Part 1: Data collection and cleaning

Prerequisites:

· You will be using the Archive end point of the NYTimes API for the project. Familiarize yourself with the endpoint (also discussed in detail in the lecture video)

· You can go to

·
Links to an external site.

· and read the documentation

· The API endpoint just needs the year, month and the API key. So if you want to get data for 2019 January, you request the url

· Parse the JSON and get the field under response -> docs -> headline -> main which is the title of the article.

· Remember: the JSON you receive from the url is all garbled and unreadable. To make it viewable you either have to google for a json formatter extension for the browser you are using (Chrome and Safari) or use Firefox where json formatting is available by default.

Output:

· Create a file named data_collection.py where you get all the posts from two different points in time: October 1918 (Spanish Flu pandemic) and October 2020 (COVID pandemic). Extract the titles and store them in two different files titled “titles_1918.txt” and “titles_2020.txt”. You should clean the titles and make sure that one title is stored per line in the file, i.e. the files should exactly have the same number of lines as the number of results returned (accessible via response -> meta -> hits).

Part 2: Analysis

Next create another file titled analysis.py in which you should read the two files titles_1918.txt and titles_2020.txt files and perform the following analysis:

1. Count the 10 words that appear most frequently in each of these files and print them in reverse order of frequency for each year. Make sure to remove the commonly occurring words in the English language using the code from the remove_stopwords function provided
here

2.
Links to an external site.

3. . You should call this function for each article title.

4. What fraction of articles did the words “flu”, “virus” and “death” appear in both the years?

5. Count occurrences of dollar amounts in the headlines and produce a total of the dollar amounts mentioned each year.

· For instance, consider three headlines:

1. “$503,200 was spent on fixing roads”

2. “Twitter subscriptions now cost $8”

3. “Covid-19 deaths could cost the economy a trillion dollars”.

· The “dollar amounts” to be considered in these headlines are the ones which have a ‘$’ symbol and a number after it. So we need to output 503200 + 8 = 503208 as output

· Hint: Use a regular expression to identify all the dollar amounts. You could use something like “\$[0-9,]+” to identify all occurrences of dollar amounts.

6. Identifies the average sentiment among these article titles (check out the section titled “Sentiment analysis in python” on Canvas. You should use the function provided in that section. Check out the video
here.

7.
Links to an external site.

8. Even though the video mentions tweets, it should work for article titles. Important:
Make sure you install vaderSentiment on replit!)

Sugar. Bonus points for making use of functions for identifying various components like counting frequency and searching for keywords.

Expected outcome: (below is an example of how you can format your output. It need not be done exactly this way, but the output of each analysis should be clearly indicated). Also note that the exact output you get will be completely different based on the data you used. The numbers shown are only for illustration. You could get different results based on your analysis.

************************

Most frequent words 1918

************************

abc, 204

ssi, 134

kasgas, 122

…

************************

Most frequent words 2020

************************

abc, 204

dead, 124

gdvjs, 102

…

************************

Fraction of articles in 1918

************************

flu 0.085

virus 0.123

death 0.03

************************

Fraction of articles in 2020

************************

flu 0.083

virus 0.14

death 0.001

************************

Dollar amounts

************************

1918 $302,402,425

2020 $204,325,381

************************

Sentiment 1918

************************

The average sentiment of the articles is 0.421

************************

Sentiment 2020

************************

The average sentiment of the articles is 0.421

>Computer Science homework help, PROJECT, Python, small

Share This Post

Order a Similar Paper and get 15% Discount on your First Order

Related Questions

In this discussion, you are going to research Vigenere Cipher. Use the Vigenere Table to encrypt the message using the key ORANGE:

In this discussion, you are going to research Vigenere Cipher. Use the Vigenere Table to encrypt the message using the key ORANGE: Hard work always pays. Now use the same key, to decrypt this message: Oktnio ok dncr Instructions: 1. Ignore all cases and punctuations 2. Do not encrypt or decrypt

Prepare the peer’s presentation topic by finding and reviewing the Internet resources and documents about the topic that you are assigned to

Prepare the peer’s presentation topic by finding and reviewing the Internet resources and documents about the topic that you are assigned to review. Prepare for your feedback and questions that you will raise throughout the presentation. provide feedback to your peer. Ask your questions, critique the presentation, and make

file attached. 1 Week 1 – Discussion: AI in Business Consider what you have read so far in the learning materials for Week 1

file attached. 1 Week 1 – Discussion: AI in Business Consider what you have read so far in the learning materials for Week 1, and write a one-page summary about it. Focus on the use of Artificial Intelligence in Business. The summary must present your understanding in your own words

The Enterprise Architecture Repository is an online, web-based platform designed to store and organize Enterprise Architecture artifacts produced by EA

The Enterprise Architecture Repository is an online, web-based platform designed to store and organize Enterprise Architecture artifacts produced by EA software tools. You work as a developer for CMS and want to protect the security of the organization. You have developed a level of trust with the development team,

Assignment Content 1. As individuals, we influence the world around us. Research ways in which millennials are influencing

Assignment Content 1. As individuals, we influence the world around us. Research ways in which millennials are influencing software engineering and the modern day workplace. Prepare a 2 page response in the APA format. Have a separate page for references. Follow the APA Format! 2. Research what is meant by

Instructions: Q1. Design a program that asks the user to enter a series of 20 numbers. The program should store the numbers in a list then display the

Instructions: Q1. Design a program that asks the user to enter a series of 20 numbers. The program should store the numbers in a list then display the following data: The lowest number in the list The highest number in the list The total of the numbers in the list The

do a cset diagram for Staples print department, Departments and Key Tasks: Print Services Department: In the diagram identify: ● major

do a cset diagram for Staples print department, Departments and Key Tasks: Print Services Department: In the diagram identify: ● major departments and their key tasks/activities/responsibilities ● major systems and databases ● threats/vulnerabilities/risks Tasks: Printing: Handling various types of printing including bulk, specialized, and quick print services. Copying: Providing self-service

IT543-4: Design an implementation of cryptographic methods for an organization. Assignment Instructions: Perform the lab described in the zip

IT543-4: Design an implementation of cryptographic methods for an organization. Assignment Instructions: Perform the lab described in the zip folder Wireshark — Capturing SSL Packets. Follow the directions, perform the indicated instructions, and provide the requested information. Take screenshots of each step to show that you are working through the steps.

Discussion The unit readings outline several challenges of implementing and upgrading enterprise information systems. Employees were

Discussion The unit readings outline several challenges of implementing and upgrading enterprise information systems. Employees were cited as one of the pain points. Why do you think this is? As an MIS leader responsible for implementing or upgrading an enterprise information system, select two or three strategies you would use

You must find a news or journal article that is evidence that the problem or issue you wish to study is something current and of concern. Please

You must find a news or journal article that is evidence that the problem or issue you wish to study is something current and of concern. Please download and use this template. (Save it per the video “How to Save and Submit Assignments, in two formats. Submit the file

test1.txt // Function with Arithmetic Expression function main returns integer; begin 7 + 2 * (5 + 4); end; test2.txt // Function with a

test1.txt // Function with Arithmetic Expression function main returns integer; begin 7 + 2 * (5 + 4); end; test2.txt // Function with a lexical error function main returns integer; begin 7 * 2 $ (2 + 4); end; test3.txt // Punctuation symbols ,;() => // Identifier name name123 //

MAKE A FUNCTIONAL DATABASE APP USING THE PICTURE I POSTED BELOW AS A REFERENCES. BE ABLE TO SAVE THE DATABASE AND ALL. I NEED SOURCE CODES AND JAR FILE.

MAKE A FUNCTIONAL DATABASE APP USING THE PICTURE I POSTED BELOW AS A REFERENCES. BE ABLE TO SAVE THE DATABASE AND ALL. I NEED SOURCE CODES AND JAR FILE. THE REQUIREMENTS ARE BELOW AS WELL.

Assignment Instructions: Research and select 3 different Big Data use cases. Create a digital artifact that details the typical business objectives and

Assignment Instructions: Research and select 3 different Big Data use cases. Create a digital artifact that details the typical business objectives and analytical solution for each use case.

Combine both codes or make it work with 2 separate tabs. I am using Arduino Ethernet Shield W5100 with Arduino UNO trying to send a notification to

Combine both codes or make it work with 2 separate tabs. I am using Arduino Ethernet Shield W5100 with Arduino UNO trying to send a notification to PushSafer. Each code works individually but I cannot get to work together.

Please arrange the files with the proper titles and project numbers 1 Project 2 JOGL OpenGL Project Overview In this project you will create a

Please arrange the files with the proper titles and project numbers 1 Project 2 JOGL OpenGL Project Overview In this project you will create a unique 3 graphics scene composed of OpenGL graphic components using transformation methods. Requirements: 1. Using Netbeans or Eclipse, develop a JOGL application that displays a

1 Homework 4 1. (10 pts) For the following program, explain the interesting elements related to threads. Focus on explaining the output of the

1 Homework 4 1. (10 pts) For the following program, explain the interesting elements related to threads. Focus on explaining the output of the program. 1 public class TaskThreadDemo { 2 public static void main (String args []) { 3 String [] sa = {“a”, “X”, “+”, “.”}; 4 for

1 Project 2 JOGL OpenGL Project Overview In this project you will create a unique 3 graphics scene composed of OpenGL graphic components using

1 Project 2 JOGL OpenGL Project Overview In this project you will create a unique 3 graphics scene composed of OpenGL graphic components using transformation methods. Requirements: 1. Using Netbeans or Eclipse, develop a JOGL application that displays a unique 3D scene. The scene has the following specifications: a. Size:

Project 3 – Ensemble Methods and Unsupervised Learning In this project you will explore some techniques in unsupervised learning as well as ensemble

Project 3 – Ensemble Methods and Unsupervised Learning In this project you will explore some techniques in unsupervised learning as well as ensemble methods. It is important to realize that understanding an algorithm or technique requires understanding how it behaves under a variety of circumstances. You will go through the

Our Services

Small Python Project

Checklist for your code

Share This Post

Related Questions

In this discussion, you are going to research Vigenere Cipher. Use the Vigenere Table to encrypt the message using the key ORANGE:

Prepare the peer’s presentation topic by finding and reviewing the Internet resources and documents about the topic that you are assigned to

file attached. 1 Week 1 – Discussion: AI in Business Consider what you have read so far in the learning materials for Week 1

The Enterprise Architecture Repository is an online, web-based platform designed to store and organize Enterprise Architecture artifacts produced by EA

Assignment Content 1. As individuals, we influence the world around us. Research ways in which millennials are influencing

Instructions: Q1. Design a program that asks the user to enter a series of 20 numbers. The program should store the numbers in a list then display the

do a cset diagram for Staples print department, Departments and Key Tasks: Print Services Department: In the diagram identify: ● major

IT543-4: Design an implementation of cryptographic methods for an organization. Assignment Instructions: Perform the lab described in the zip

Discussion The unit readings outline several challenges of implementing and upgrading enterprise information systems. Employees were

You must find a news or journal article that is evidence that the problem or issue you wish to study is something current and of concern. Please

test1.txt // Function with Arithmetic Expression function main returns integer; begin 7 + 2 * (5 + 4); end; test2.txt // Function with a

MAKE A FUNCTIONAL DATABASE APP USING THE PICTURE I POSTED BELOW AS A REFERENCES. BE ABLE TO SAVE THE DATABASE AND ALL. I NEED SOURCE CODES AND JAR FILE.

Assignment Instructions: Research and select 3 different Big Data use cases. Create a digital artifact that details the typical business objectives and

Combine both codes or make it work with 2 separate tabs. I am using Arduino Ethernet Shield W5100 with Arduino UNO trying to send a notification to

Please arrange the files with the proper titles and project numbers 1 Project 2 JOGL OpenGL Project Overview In this project you will create a

1 Homework 4 1. (10 pts) For the following program, explain the interesting elements related to threads. Focus on explaining the output of the

1 Project 2 JOGL OpenGL Project Overview In this project you will create a unique 3 graphics scene composed of OpenGL graphic components using

Project 3 – Ensemble Methods and Unsupervised Learning In this project you will explore some techniques in unsupervised learning as well as ensemble

Use Our 6 Free Tools

We Accept