RDAgent Finetune LLM #1314

XianBW · 2025-12-15T07:05:38Z

Description

Motivation and Context

How Has This Been Tested?

If you are adding a new feature, test on your own test scripts.

Screenshots of Test Results (if appropriate):

Your own tests:

Types of changes

Fix bugs
Add new feature
Update documentation

📚 Documentation preview 📚: https://RDAgent--1314.org.readthedocs.build/en/1314/

* feat: add iterative evolve and evaluation support with partial chain stop * feat: add FTDataEvaluator and support multiple implement functions in finetune

…1303) * feat:(1) support for multi layer dataset extraction (2) add category.json for dataset in datasets/ * fix: fix bug for generate category.json * feat: add get_dataset_folder_desc * init data proposal and merge qzli/ft * update data proposal prompts and add max_position_embeddings and resolve confilcts * remove sample counts in data proposal * turn data and train to unified hypo_gen * refine prompts * remove category.json and add it to dataset_info * fix jinja problem and proposal done * lint * add ai-generated description and raw readme into dataset_info.json * update prompt for description * add datasets * initial fix for proposal of data * final version for data proposal * lint

* refactor(dataset): add stats into dataset_info.json, and remove dataset from gitignore_folder * feat: enable data coder and run data process

* feat: implement finetune data coding, evaluation, and config improvements * fix: deepspeed config path * fix: dataset info columns --------- Co-authored-by: Young <[email protected]>

… description

Jensen246 and others added 30 commits November 22, 2025 21:13

fix lora eval bug

517347c

some update for benchmark

836d187

update feedback

a5b2aca

finalize feedback

c15bcfb

user benchmark as input

0e7dc31

clean benchmark code

4d5d19a

some update to benchmark.py

83eba89

new benchmark file

c293ba0

several small update

7a78046

modify the prompt of coder

a837282

remove oft, which has been obsoleted

55601b7

comment for 2 level dataset instructure

8f83d31

prompts and dockerfile refine for using deepspeed and fa2

0c92d34

lint

b95254c

hot fix

2e7f69a

several major update

3cf9ac9

prompt key refinement

84972cf

refine prompt

dc2e96a

Merge branch 'main' into qzli/ft

eb613cd

small update

7d2b64b

fix a small bug

3574238

remove debug config after execution

2056c0b

fix: only remove <think> at start

0979827

feat: support creating dataset & multi-eval frame (#1302)

1f2ca73

* feat: add iterative evolve and evaluation support with partial chain stop * feat: add FTDataEvaluator and support multiple implement functions in finetune

feat: add stats in dataset_info, and enable data coder (#1306)

e489d9c

* refactor(dataset): add stats into dataset_info.json, and remove dataset from gitignore_folder * feat: enable data coder and run data process

feat: Merge data coder (#1307)

e104f50

* feat: implement finetune data coding, evaluation, and config improvements * fix: deepspeed config path * fix: dataset info columns --------- Co-authored-by: Young <[email protected]>

replace str length with token_limit

5b7dc33

add readme to dataset_info and remove useless blank lines in scenario…

41fc3c5

… description

feat: dataset prepare

a7e2734

Jensen246 force-pushed the finetune branch 3 times, most recently from 2fca13c to 726ad84 Compare January 15, 2026 15:30

feat: sync litellm log

dfdd000

Jensen246 force-pushed the finetune branch from 726ad84 to dfdd000 Compare January 15, 2026 15:33

Jensen246 added 2 commits January 15, 2026 15:55

fix: gpu memory format

f8d9203

fix: escape special characters in benchmark desc

cb8889f

Jensen246 force-pushed the finetune branch from a918195 to cb8889f Compare January 15, 2026 16:31

fix: set data processing timeout to 1h

a8aeadf

Jensen246 force-pushed the finetune branch from f42c193 to a8aeadf Compare January 16, 2026 08:06

Jensen246 added 20 commits January 17, 2026 07:36

feat: set valid_loss and save_best_model

0466818

fix: inject timeout and stage

f227a87

fix: loss history extract logic

26d6561

feat: inject output dir

9301b2a

feat: inject eval batch size

0dd026b

feat: inject save_total_limit

e64a9f9

feat: update data prompt

16e9e24

fix: escape shell special characters

9bea9c4

fix: tablebench visualization UI

fa37212

fix: move implementation validation to coder, and ignore injected params

2d47a11

feat: README for FinanceIQ dataset

80d2c33

fix: bioprobench desc error

fa2b641

fix: remove task alignment when coder eval

a2cb563

fix: FinanceIQ now extracts last capital as answer

acbd3ce

fix: stdout contains binary data

49667d5

feat: recover estimate full output and set eval setting automatically

07b343c

fix(ui): precision for summary table

8abb081

fix(ui): import error

4ed9a5b

feat: try to use lora

40f14a1

fix(api): fix litellm bug for code block

2ef0841

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RDAgent Finetune LLM #1314

RDAgent Finetune LLM #1314

Uh oh!

XianBW commented Dec 15, 2025 •

edited by github-actions bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Uh oh!

RDAgent Finetune LLM #1314

Are you sure you want to change the base?

RDAgent Finetune LLM #1314

Uh oh!

Conversation

XianBW commented Dec 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

How Has This Been Tested?

Screenshots of Test Results (if appropriate):

Types of changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

XianBW commented Dec 15, 2025 •

edited by github-actions bot

Loading