@@ -14,87 +14,87 @@ system:
1414 link : https://huggingface.co/jondurbin/airoboros-dpo-110b-3.3
1515 type : text
1616 performanceclass : full
17- basemodelname : Airoboros -110B-3.3
17+ basemodelname : Qwen1.5 -110B
1818 endmodelname : Airoboros-DPO-110B-3.3
1919 endmodellicense : Purposely left ambiguous
20- releasedate : 2023-07
21- notes :
20+ releasedate : 2024-05
21+ notes : Experimental model tuned primarily from synthetic data generated by the airoboros implementation of LLM self-instruction.
2222
2323org :
2424 name : Jon Durbin
2525 link : https://github.com/jondurbin
26- notes : Only active on GitHub since May 2023
26+ notes : Primary contributor to chutes.ai.
2727
2828# availability:
2929datasources_basemodel :
3030 class : closed
3131 link :
32- notes : Llama2 training data is nowhere documented or disclosed
32+ notes : Base model data sources nowhere documented or specified.
3333
3434datasources_endmodel :
35- class : open
36- link : https://github.com/ jondurbin/airoboros
37- notes : Airoboros , an implementation of the Self-Instruct paper
35+ class : partial
36+ link : https://huggingface.co/datasets/ jondurbin/airoboros-3.2, https://huggingface.co/datasets/bluemoon-fandom-1-1-rp-cleaned, https://huggingface.co/datasets/boolq, https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1, https://huggingface.co/datasets/LDJnr/Capybara, https://huggingface.co/datasets/jondurbin/cinematika-v0.1, https://huggingface.co/datasets/glaiveai/glaive-function-calling-v2, https://huggingface.co/datasets/grimulkan/LimaRP-augmented, https://huggingface.co/datasets/piqa, https://huggingface.co/datasets/Vezora/Tested-22k-Python-Alpaca, https://huggingface.co/datasets/mattpscott/airoboros-summarization, https://huggingface.co/datasets/unalignment/toxic-dpo-v0.2
37+ notes : Most data generated by airoboros , an implementation of the Self-Instruct paper. Many other data sources linked. One data source links to a 404 page. Additional data sources linked in tags but not outlined in model card.
3838
3939weights_basemodel :
40- class : partial
41- link :
42- notes : Llama2, made conditionally available by Meta
40+ class : open
41+ link : https://huggingface.co/Qwen/Qwen1.5-110B
42+ notes : Weights available through HuggingFace.
4343
4444weights_endmodel :
4545 class : open
46- link : https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/tree/main
47- notes : Made available through HuggingFace
46+ link : https://huggingface.co/jondurbin/airoboros-dpo-110b-3.3
47+ notes : Weights available through HuggingFace.
4848
4949trainingcode :
5050 class : partial
51- link : https://gist. github.com/jondurbin/87fc040b92a3073125ed516b04bc6e19
52- notes : Repo exists for RL data but only a gist exists for model training and architecture
51+ link : https://github.com/jondurbin/qlora
52+ notes : Repo exists only for general fine-tuning implementation.
5353
5454# documentation:
5555code :
5656 class : partial
57- link : https://github.com/jondurbin/airoboros/tree/main/airoboros
58- notes : What little code available is not very systematically documented
57+ link : https://github.com/jondurbin/qlora
58+ notes : Code is not very systematically documented.
5959
6060architecture :
61- class : partial
62- link : https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/discussions/2#64c29e4c617b36543dedac9a
63- notes : Some info can be gleaned at link but most remains undocumented
61+ class : closed
62+ link :
63+ notes : Earlier airoboros model versions included architecture information. Latest versions (>3.1.2) do not.
6464
6565preprint :
6666 class : closed
6767 link :
68- notes : No preprint found
68+ notes : No preprint found.
6969
7070paper :
7171 class : closed
7272 link :
73- notes : No peer-reviewed paper found
73+ notes : No peer-reviewed paper found.
7474
7575modelcard :
7676 class : partial
77- link : https://huggingface.co/jondurbin/airoboros-65b-gpt4-1.4
78- notes : Instructs reader to look up model card for prior 65B Llama1 version
77+ link : https://huggingface.co/jondurbin/airoboros-dpo-110b-3.3
78+ notes : Model card contains some information, mainly relating to inference and licensing.
7979
8080datasheet :
81- class : partial
82- link : https://huggingface.co/datasets/jondurbin/airoboros-gpt4-1.4.1
83- notes : Datasheet for RL data only
81+ class : closed
82+ link : https://huggingface.co/datasets/jondurbin/airoboros-3.2, https://huggingface.co/datasets/bluemoon-fandom-1-1-rp-cleaned, https://huggingface.co/datasets/boolq, https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1, https://huggingface.co/datasets/LDJnr/Capybara, https://huggingface.co/datasets/jondurbin/cinematika-v0.1, https://huggingface.co/datasets/glaiveai/glaive-function-calling-v2, https://huggingface.co/datasets/grimulkan/LimaRP-augmented, https://huggingface.co/datasets/piqa, https://huggingface.co/datasets/Vezora/Tested-22k-Python-Alpaca, https://huggingface.co/datasets/mattpscott/airoboros-summarization, https://huggingface.co/datasets/unalignment/toxic-dpo-v0.2
83+ notes : Datasheets contain differing levels of documentation. Exact data used as well as data collection and curation procedure unknown.
8484
8585# access:
8686package :
87- class : open
87+ class : partial
8888 link : https://pypi.org/project/airoboros/
89- notes : Pip package available.
89+ notes : Pip package available, however only supports older versions of airoboros .
9090
9191api :
9292 class : closed
9393 link :
94- notes : No API found
94+ notes : No API found.
9595 metaprompt : closed
9696
9797licenses :
9898 class : partial
99- link : https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1 #licence-and-usage-restrictions
100- notes : Licensing left ambiguous because of murky status of OpenAI-derived Self-Instruct data
99+ link : https://huggingface.co/jondurbin/airoboros-dpo-110b-3.3 #licence-and-usage-restrictions
100+ notes : Licensing left ambiguous because of murky status of OpenAI-derived Self-Instruct data.
0 commit comments