Final-Project-01205489: News Headlines Dataset For Sarcasm Detection

Final Project 01205489(Principles of Deep Learning and Applications)

This project's purpose is to analyze and classify sarcasm sentences using the dataset from Kaggle The dataset is from Kaggle Dataset.

The project is made of 3 parts; visualize and analyze data, ML model prediction, and Glove pre-train model prediction. For the first part, we clean and preprocess data with stop words, and punctuation and then visualize data with the distribution of length of the word, number of words in the headline, and average word length in the headline to find if that dataset is biased or not. Common words, N-gram analysis, and word cloud is needed for one-word prediction. The second part is the tokenization of words into vectors and trains in ML models. The decision tree and Random Forest show us an interesting result. Lastly, Glove pre-train model by using word embedding gave an incredible result with the long period of time to train For Special Section, compared Glove model with different embedding words and show why I use a combined dataset

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
News Headlines Dataset For Sarcasm Detection.ipynb		News Headlines Dataset For Sarcasm Detection.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Final-Project-01205489: News Headlines Dataset For Sarcasm Detection

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Final-Project-01205489: News Headlines Dataset For Sarcasm Detection

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages