-
Notifications
You must be signed in to change notification settings - Fork 5.8k
Pull requests: karpathy/nanochat
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: guard target-param-data-ratio against zero to avoid ZeroDivision…
#579
opened Feb 28, 2026 by
suraj-self
Loading…
Fix role validation error message to match allowed roles
#577
opened Feb 28, 2026 by
dhunganapramod9
Loading…
Validate --source so invalid values don’t raise KeyError
#576
opened Feb 28, 2026 by
dhunganapramod9
Loading…
Fix SSE parsing when model output contains "data: "
#575
opened Feb 28, 2026 by
dhunganapramod9
Loading…
fix(tokenizer): Avoid IndexError when conversation has only a system message
#574
opened Feb 28, 2026 by
dhunganapramod9
Loading…
fix(chat_web): validate ,--num-gpus against available GPUs for clear s…
#573
opened Feb 28, 2026 by
dhunganapramod9
Loading…
Remove inherited parameters from scripts
code robustness
scripts
Edits in the bash scripts
suggest/merge
#568
opened Feb 26, 2026 by
svlandeg
Loading…
fix: add retry with exponential backoff to download_file_with_lock
code robustness
#555
opened Feb 22, 2026 by
icenfly
Loading…
fix: clamp few-shot sample size to prevent ValueError in core_eval
code robustness
suggest/review
#551
opened Feb 21, 2026 by
suraj-self
Loading…
dataset: pin validation shard
bug
Something isn't working
suggest/merge
#545
opened Feb 19, 2026 by
KartikVashishta
Loading…
dataloader: reuse cropped remainders to reduce token waste ~35% → ~23%
improvement
waiting
Waiting for user feedback/action
#544
opened Feb 19, 2026 by
handsomesun
Loading…
Improve error handling for batch size alignment
suggest/merge
UX
#533
opened Feb 15, 2026 by
suraj-self
Loading…
Speed up CORE metric evaluation with batched GPU inference and cross-call caching
feature
New feature or request
#526
opened Feb 13, 2026 by
AutomaticHourglass
Loading…
feat: Enhanced platform detection (ARM64, RISC-V, device models)
feature
New feature or request
#522
opened Feb 11, 2026 by
RalphBigBear
Loading…
docs: Add Q-Lite deployment guide + edge device integration
docs
Improvements or additions to documentation
#521
opened Feb 11, 2026 by
RalphBigBear
Loading…
Fix RoPE cache overflow for long prompts with KV cache
code robustness
suggest/review
#520
opened Feb 10, 2026 by
dipeshbabu
Loading…
Fix hellaswag memory leak and extract cleanup config
improvement
waiting
Waiting for user feedback/action
#519
opened Feb 9, 2026 by
haltingstate
Loading…
4 tasks
use 1-sqrt warmdown shape for LR schedule
potential_improvement
#513
opened Feb 8, 2026 by
spjosyula
Loading…
add multi-node training script for distributed training setup
feature
New feature or request
scripts
Edits in the bash scripts
#511
opened Feb 6, 2026 by
kunwar-vikrant
Loading…
Change default sliding window pattern to the recommended "L" when FA3 is not available
suggest/merge
UX
#509
opened Feb 6, 2026 by
ddudek
Loading…
LRU Cache for _get_sliding_window_mask and convert tensors once in dataloader
improvement
suggest/review
#501
opened Feb 4, 2026 by
Narden91
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.