Kodezi
diff --git a/‎.github/ISSUE_TEMPLATE/research_question.md‎
Lines changed: 22 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/research_question.md‎
Lines changed: 22 additions & 0 deletions
diff --git a/‎.github/workflows/quality.yml‎
Lines changed: 42 additions & 0 deletions b/‎.github/workflows/quality.yml‎
Lines changed: 42 additions & 0 deletions
diff --git a/‎.github/workflows/tests.yml‎
Lines changed: 48 additions & 0 deletions b/‎.github/workflows/tests.yml‎
Lines changed: 48 additions & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 92 additions & 0 deletions b/‎.gitignore‎
Lines changed: 92 additions & 0 deletions
diff --git a/‎CHANGELOG.md‎
Lines changed: 64 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 64 additions & 0 deletions
diff --git a/‎CITATION.cff‎
Lines changed: 51 additions & 0 deletions b/‎CITATION.cff‎
Lines changed: 51 additions & 0 deletions
diff --git a/‎CODE_OF_CONDUCT.md‎
Lines changed: 45 additions & 0 deletions b/‎CODE_OF_CONDUCT.md‎
Lines changed: 45 additions & 0 deletions
@@ -0,0 +1,22 @@
+---
+name: Research Question
+about: Ask questions about the Chronos research or methodology
+title: '[RESEARCH] '
+labels: 'question, research'
+assignees: ''
+---
+
+**Research Topic**
+Which aspect of the research are you asking about? (e.g., AGR mechanism, evaluation methodology, benchmark design)
+
+**Your Question**
+Clearly state your research question or area of confusion.
+
+**Context**
+Have you:
+- [ ] Read the full research paper?
+- [ ] Checked the FAQ?
+- [ ] Searched existing issues?
+
+**Additional Information**
+Any additional context, references, or related work that might be relevant.
@@ -0,0 +1,42 @@
+name: Code Quality
+
+on:
+  push:
+    branches: [ main, develop ]
+  pull_request:
+    branches: [ main ]
+
+jobs:
+  quality:
+    runs-on: ubuntu-latest
+    
+    steps:
+    - uses: actions/checkout@v3
+    
+    - name: Set up Python
+      uses: actions/setup-python@v4
+      with:
+        python-version: '3.10'
+    
+    - name: Install dependencies
+      run: |
+        python -m pip install --upgrade pip
+        pip install black flake8 mypy isort
+        pip install -r requirements.txt
+    
+    - name: Format check with Black
+      run: |
+        black --check benchmarks/ tests/ scripts/
+    
+    - name: Import sort check with isort
+      run: |
+        isort --check-only benchmarks/ tests/ scripts/
+    
+    - name: Lint with flake8
+      run: |
+        flake8 benchmarks/ tests/ scripts/ --count --select=E9,F63,F7,F82 --show-source --statistics
+        flake8 benchmarks/ tests/ scripts/ --count --exit-zero --max-complexity=10 --max-line-length=88 --statistics
+    
+    - name: Type check with mypy
+      run: |
+        mypy benchmarks/ --ignore-missing-imports
@@ -0,0 +1,48 @@
+name: Tests
+
+on:
+  push:
+    branches: [ main, develop ]
+  pull_request:
+    branches: [ main ]
+
+jobs:
+  test:
+    runs-on: ubuntu-latest
+    strategy:
+      matrix:
+        python-version: [3.8, 3.9, '3.10', 3.11]
+
+    steps:
+    - uses: actions/checkout@v3
+    
+    - name: Set up Python ${{ matrix.python-version }}
+      uses: actions/setup-python@v4
+      with:
+        python-version: ${{ matrix.python-version }}
+    
+    - name: Cache dependencies
+      uses: actions/cache@v3
+      with:
+        path: ~/.cache/pip
+        key: ${{ runner.os }}-pip-${{ hashFiles('**/requirements.txt') }}
+        restore-keys: |
+          ${{ runner.os }}-pip-
+    
+    - name: Install dependencies
+      run: |
+        python -m pip install --upgrade pip
+        pip install -r requirements.txt
+        pip install -e .
+    
+    - name: Run tests with pytest
+      run: |
+        pytest tests/ -v --cov=benchmarks --cov-report=xml --cov-report=html
+    
+    - name: Upload coverage to Codecov
+      uses: codecov/codecov-action@v3
+      with:
+        file: ./coverage.xml
+        flags: unittests
+        name: codecov-umbrella
+        fail_ci_if_error: false
@@ -0,0 +1,92 @@
+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+
+# Virtual environments
+venv/
+env/
+ENV/
+.venv
+
+# Jupyter Notebooks
+.ipynb_checkpoints
+*.ipynb_checkpoints/
+
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+.project
+.pydevproject
+
+# OS
+.DS_Store
+.DS_Store?
+._*
+.Spotlight-V100
+.Trashes
+ehthumbs.db
+Thumbs.db
+
+# Testing
+.coverage
+.pytest_cache/
+htmlcov/
+.tox/
+.nox/
+coverage.xml
+*.cover
+.hypothesis/
+
+# Documentation
+docs/_build/
+site/
+
+# Data and results
+data/raw/
+data/processed/
+results/raw_data/sensitive/
+*.pkl
+*.h5
+*.hdf5
+
+# Logs
+*.log
+logs/
+
+# Environment variables
+.env
+.env.local
+
+# Temporary files
+tmp/
+temp/
+*.tmp
+*.temp
+*.bak
+
+# Model files (proprietary)
+models/
+weights/
+checkpoints/
@@ -0,0 +1,64 @@
+# Changelog
+
+All notable changes to the Kodezi Chronos research repository will be documented in this file.
+
+The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
+and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
+
+## [1.0.0] - 2025-07-14
+
+### Added
+- Initial release of Kodezi Chronos research repository
+- Complete research paper (arXiv:2507.12482)
+- Multi Random Retrieval (MRR) benchmark specification
+- Comprehensive evaluation results and metrics
+- Adaptive Graph-Guided Retrieval (AGR) documentation
+- Architecture overview and design principles
+- Case studies demonstrating debugging capabilities
+- Contribution guidelines and code of conduct
+- FAQ and documentation
+
+### Research Highlights
+- 65.3% debugging success rate (6-7x improvement over GPT-4)
+- 78.4% root cause accuracy
+- 91% retrieval precision with AGR
+- 40% reduction in debugging cycles
+- Successful handling of repository-scale contexts
+
+### Benchmark Results
+- 5,000 real-world debugging scenarios evaluated
+- Statistical significance (p < 0.001) across all metrics
+- Comprehensive ablation studies
+- Performance analysis across bug categories and repo sizes
+
+### Documentation
+- Complete API design documentation
+- Evaluation methodology and protocols
+- Reproduction guidelines for researchers
+- Integration patterns for future deployment
+
+### Known Limitations
+- Lower performance on hardware-specific bugs (23.4%)
+- Challenges with domain-specific logic (28.7%)
+- Limited effectiveness on UI/visual bugs (8.3%)
+
+## Future Releases
+
+### [Planned for Q4 2025]
+- Model release through Kodezi OS platform
+- Additional language support announcements
+- Extended benchmark suite
+- Performance optimizations
+
+### [Planned for Q1 2026]
+- Full Kodezi OS integration
+- Enterprise deployment options
+- Advanced debugging features
+- Cross-repository capabilities
+
+---
+
+For more information about Kodezi Chronos:
+- Research Paper: [arXiv:2507.12482](https://arxiv.org/abs/2507.12482)
+- Model Access: [https://kodezi.com/os](https://kodezi.com/os)
+- Contact: [email protected]
@@ -0,0 +1,51 @@
+cff-version: 1.2.0
+title: "Kodezi Chronos: A Debugging-First Language Model for Repository-Scale, Memory-Driven Code Understanding"
+message: "If you use this research, please cite it as below."
+type: software
+authors:
+  - given-names: Ishraq
+    family-names: Khan
+    email: [email protected]
+    affiliation: Kodezi Inc.
+    orcid: 'https://orcid.org/0000-0000-0000-0000'
+  - given-names: Assad
+    family-names: Chowdary
+    email: [email protected]
+    affiliation: Kodezi Inc.
+  - given-names: Sharoz
+    family-names: Haseeb
+    email: [email protected]
+    affiliation: Kodezi Inc.
+  - given-names: Urvish
+    family-names: Patel
+    email: [email protected]
+    affiliation: Kodezi Inc.
+identifiers:
+  - type: doi
+    value: 10.48550/arXiv.2507.12482
+    description: arXiv preprint
+repository-code: 'https://github.com/kodezi/chronos-research'
+url: 'https://kodezi.com/chronos'
+repository: 'https://arxiv.org/abs/2507.12482'
+abstract: >-
+  Large Language Models (LLMs) have advanced code generation and software automation, 
+  but are fundamentally constrained by limited inference-time context and lack of 
+  explicit code structure reasoning. We introduce Kodezi Chronos, a next-generation 
+  architecture for autonomous code understanding, debugging, and maintenance, designed 
+  to operate across ultra-long contexts comprising entire codebases, histories, and 
+  documentation—all without fixed window limits. Kodezi Chronos leverages a multi-level 
+  embedding memory engine, combining vector and graph-based indexing with continuous 
+  code-aware retrieval. This enables efficient and accurate reasoning over millions 
+  of lines of code, supporting repository-scale comprehension, multi-file refactoring, 
+  and real-time self-healing actions. Chronos achieves 65.3% debugging success rate, 
+  representing a 6-7x improvement over state-of-the-art models.
+keywords:
+  - debugging
+  - language models
+  - code understanding
+  - software engineering
+  - autonomous systems
+  - memory-driven AI
+license: MIT
+version: 1.0.0
+date-released: '2025-07-14'
@@ -0,0 +1,45 @@
+# Code of Conduct - Chronos Research Repository
+
+## Important Notice
+
+This repository contains research materials, benchmarks, and documentation for the Kodezi Chronos debugging model. The model itself is proprietary and available only through Kodezi OS. Learn more at https://chronos.so
+
+## Our Pledge
+
+We as members, contributors, and leaders pledge to make participation in our research community a harassment-free experience for everyone, regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, level of experience, education, socio-economic status, nationality, personal appearance, race, religion, or sexual identity and orientation.
+
+## Our Standards
+
+Examples of behavior that contributes to a positive environment:
+
+* Using welcoming and inclusive language
+* Being respectful of differing viewpoints and experiences
+* Gracefully accepting constructive criticism
+* Focusing on what is best for the community
+* Showing empathy towards other community members
+
+Examples of unacceptable behavior:
+
+* The use of sexualized language or imagery
+* Trolling, insulting or derogatory comments, and personal attacks
+* Public or private harassment
+* Publishing others' private information without permission
+* Other conduct which could reasonably be considered inappropriate
+
+## Enforcement Responsibilities
+
+Community leaders are responsible for clarifying and enforcing our standards of acceptable behavior and will take appropriate and fair corrective action in response to any behavior that they deem inappropriate, threatening, offensive, or harmful.
+
+## Scope
+
+This Code of Conduct applies within all community spaces, and also applies when an individual is officially representing the community in public spaces.
+
+## Enforcement
+
+Instances of abusive, harassing, or otherwise unacceptable behavior may be reported to the community leaders responsible for enforcement at [email protected].
+
+All complaints will be reviewed and investigated promptly and fairly.
+
+## Attribution
+
+This Code of Conduct is adapted from the [Contributor Covenant](https://www.contributor-covenant.org/), version 2.0.