-
Notifications
You must be signed in to change notification settings - Fork 6
renamed extract_headline to extract_article, reduced the function cal… #18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
…l for downloading the article to 1
try: | ||
self.article.download() | ||
self.article.parse() | ||
return self.article |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is no need to return the article since only the connection adapters need to be downloaded and parsed. Returning does not do anything since the article is already defined in self.article
.
self.article.download() | ||
self.article.parse() | ||
return self.article | ||
except newspaper.article.ArticleException: # List possible errors in case of any exception |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For this issue; #17
We could also simply say;
except newspaper.article.ArticleException as e:
print(e)
self.neg = [] # Variable to store all negative tokens from negative_headlines.csv file | ||
self.article = newspaper.Article(self.news_url) | ||
|
||
# extract headline |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
# extract article
|
||
# Initialisations | ||
def __init__(self): | ||
self.news_url = input("\nEnter The URL : ") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should consider working with config files, requirement files and readme.md's from now on. This improves readability, efficiency and minimizes the chance someone fricks with the code.
|
||
# main of class | ||
def main(self): | ||
article = self.extract_article() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why would you define article here if self.article is already initiated?
There are 2 ways around this;
- Instead of
article = self.extract_article()
sayself.extract_article()
- In
def __init__
makeself.article = self.extract_article()
where the entired article is initiated, downloaded and parsed. Here you would want the return.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I prefer 2
…l for downloading the article to 1