Statistical-Language-Modelling-Using-N-grams

Using Natural Language Processing, the model predicts the most probable next word and outputs the correctness of an input English sentence. To achieve the optimum accuracy, a largereliable dataset or corpus is extracted from Wikipedia, preprocessed, and then analyzed before using it to train the model. Analyzing the dataset and its visualization can be an insightful technique to understand the corpus before using it for the model's training. Choosing an appropriate model for any problem is a crucial step. In our case, using a trigram model to train the data proved to be the best trade-of . This trained model is finally used in thecodeto predict the next word and find the perplexity of a given sentence based on the trigram model.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
Report_Statisitical_Language_Model.pdf		Report_Statisitical_Language_Model.pdf
SourceCode.py		SourceCode.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Statistical-Language-Modelling-Using-N-grams

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Statistical-Language-Modelling-Using-N-grams

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages