Text Summarization AI with Readability Analysis

A comprehensive text analysis application that combines readability assessment with AI-powered summarization and paraphrasing using state-of-the-art models.

Features

📊 Readability Analysis

Flesch-Kincaid Score: Measures text difficulty (higher = easier)
Gunning Fog Index: Measures complexity (lower = simpler)
SMOG Index: Estimates grade level needed
Sentence Distribution: Classifies sentences as Beginner/Intermediate/Advanced

📝 AI Text Summarization

Multiple Models: PEGASUS, FLAN-T5, and BART
Length Options: Short (~50 words), Medium (~150 words), Long (~300 words)
ROUGE Evaluation: Automatic quality assessment with precision, recall, and F1 scores

🔄 AI Text Paraphrasing

Multiple Models: FLAN-T5 and BART for paraphrasing
Complexity Levels: Simplified, Standard, Enhanced, Academic
Paraphrasing Levels: Sentence, Paragraph, and Full Text
Semantic Similarity: Automatic quality assessment using sentence transformers
Side-by-Side Comparison: Original vs paraphrased text display

📁 File Support

Text Files: TXT, CSV, Excel (XLSX/XLS), PDF
User Management: Registration, login, profile management
File Storage: Secure database storage with metadata

Installation

Option 1: Automatic Installation (Windows)

Double-click install_dependencies.bat
Wait for installation to complete (includes NLTK data download)
Run the application: streamlit run app.py

Option 2: Manual Installation

# Install all required packages
pip install -r requirements.txt

# Download NLTK data
python download_nltk_data.py

# Or install individually
pip install rouge-score accelerate sentence-transformers transformers

Option 3: Using the Application Without AI Features

If you encounter issues with AI dependencies, the application will still work with:

Readability analysis
Text statistics
File upload and storage

Usage

Start the application: streamlit run app.py
Register/Login: Create an account or sign in
Upload a document: Drag and drop TXT, CSV, Excel, or PDF files
Analyze text: Use the tabs to explore different analyses
Generate summaries: Choose model and length for AI summarization
Paraphrase text: Select model, complexity level, and paraphrasing level

Models Used

Summarization Models

PEGASUS: Excellent for news articles and factual content
FLAN-T5: Versatile and accurate for various text types
BART: Great for general text summarization

Paraphrasing Models

FLAN-T5: Good for general paraphrasing with prompt engineering
BART: Good for creative rewording and style changes

Readability Metrics

Flesch-Kincaid: 70+ (easy), 50-70 (moderate), <50 (difficult)
Gunning Fog: ≤8 (simple), 8-12 (moderate), ≥13 (complex)
SMOG Index: ≤8 (elementary), 9-12 (high school), >12 (college)

Complexity Levels for Paraphrasing

Simplified: Uses simpler vocabulary and shorter sentences
Standard: Maintains original complexity level
Enhanced: Uses more sophisticated vocabulary and complex structures
Academic: Formal academic style with advanced terminology

Troubleshooting

Missing Dependencies

If you see "AI features are not available":

Run pip install rouge-score accelerate sentence-transformers
Restart the application

NLTK Data Issues

If you see "Resource punkt_tab not found":

Run python download_nltk_data.py
Or manually run: python -c "import nltk; nltk.download('punkt'); nltk.download('punkt_tab')"
Restart the application

Model Loading Issues

Models are downloaded automatically on first use
Ensure stable internet connection for initial download
GPU acceleration is used if available, falls back to CPU

Technical Details

Framework: Streamlit
Database: SQLite
AI Models: Hugging Face Transformers
Evaluation: ROUGE Score, Sentence Transformers
Authentication: JWT tokens with bcrypt hashing

License

This project is for educational and research purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
__pycache__		__pycache__
sqlite-tools-win-x64-3500400		sqlite-tools-win-x64-3500400
.gitignore		.gitignore
README.md		README.md
app.py		app.py
auth.db		auth.db
download_nltk_data.py		download_nltk_data.py
er.name		er.name
file_uploads.db		file_uploads.db
install_dependencies.bat		install_dependencies.bat
migrate.py		migrate.py
[email protected]		[email protected]
readability_dashboard.py		readability_dashboard.py
requirements.txt		requirements.txt
sqlite-tools-win-x64-3500400.zip		sqlite-tools-win-x64-3500400.zip
sqlite3.exe		sqlite3.exe
users.db		users.db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text Summarization AI with Readability Analysis

Features

📊 Readability Analysis

📝 AI Text Summarization

🔄 AI Text Paraphrasing

📁 File Support

Installation

Option 1: Automatic Installation (Windows)

Option 2: Manual Installation

Option 3: Using the Application Without AI Features

Usage

Models Used

Summarization Models

Paraphrasing Models

Readability Metrics

Complexity Levels for Paraphrasing

Troubleshooting

Missing Dependencies

NLTK Data Issues

Model Loading Issues

Technical Details

License

About

Uh oh!

Releases

Packages

Languages

iAmSoundarya/Text-Morph-Advanced-Summarization-And-Paraphrasing-Using-AI

Folders and files

Latest commit

History

Repository files navigation

Text Summarization AI with Readability Analysis

Features

📊 Readability Analysis

📝 AI Text Summarization

🔄 AI Text Paraphrasing

📁 File Support

Installation

Option 1: Automatic Installation (Windows)

Option 2: Manual Installation

Option 3: Using the Application Without AI Features

Usage

Models Used

Summarization Models

Paraphrasing Models

Readability Metrics

Complexity Levels for Paraphrasing

Troubleshooting

Missing Dependencies

NLTK Data Issues

Model Loading Issues

Technical Details

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages