Skip to content

Commit 9e28997

Browse files
committed
Update Airoboros. Test look on preview site.
1 parent ce37484 commit 9e28997

File tree

1 file changed

+32
-32
lines changed

1 file changed

+32
-32
lines changed

airoboros.yaml

Lines changed: 32 additions & 32 deletions
Original file line numberDiff line numberDiff line change
@@ -14,87 +14,87 @@ system:
1414
link: https://huggingface.co/jondurbin/airoboros-dpo-110b-3.3
1515
type: text
1616
performanceclass: full
17-
basemodelname: Airoboros-110B-3.3
17+
basemodelname: Qwen1.5-110B
1818
endmodelname: Airoboros-DPO-110B-3.3
1919
endmodellicense: Purposely left ambiguous
20-
releasedate: 2023-07
21-
notes:
20+
releasedate: 2024-05
21+
notes: Experimental model tuned primarily from synthetic data generated by the airoboros implementation of LLM self-instruction.
2222

2323
org:
2424
name: Jon Durbin
2525
link: https://github.com/jondurbin
26-
notes: Only active on GitHub since May 2023
26+
notes: Primary contributor to chutes.ai.
2727

2828
# availability:
2929
datasources_basemodel:
3030
class: closed
3131
link:
32-
notes: Llama2 training data is nowhere documented or disclosed
32+
notes: Base model data sources nowhere documented or specified.
3333

3434
datasources_endmodel:
35-
class: open
36-
link: https://github.com/jondurbin/airoboros
37-
notes: Airoboros, an implementation of the Self-Instruct paper
35+
class: partial
36+
link: https://huggingface.co/datasets/jondurbin/airoboros-3.2, https://huggingface.co/datasets/bluemoon-fandom-1-1-rp-cleaned, https://huggingface.co/datasets/boolq, https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1, https://huggingface.co/datasets/LDJnr/Capybara, https://huggingface.co/datasets/jondurbin/cinematika-v0.1, https://huggingface.co/datasets/glaiveai/glaive-function-calling-v2, https://huggingface.co/datasets/grimulkan/LimaRP-augmented, https://huggingface.co/datasets/piqa, https://huggingface.co/datasets/Vezora/Tested-22k-Python-Alpaca, https://huggingface.co/datasets/mattpscott/airoboros-summarization, https://huggingface.co/datasets/unalignment/toxic-dpo-v0.2
37+
notes: Most data generated by airoboros, an implementation of the Self-Instruct paper. Many other data sources linked. One data source links to a 404 page. Additional data sources linked in tags but not outlined in model card.
3838

3939
weights_basemodel:
40-
class: partial
41-
link:
42-
notes: Llama2, made conditionally available by Meta
40+
class: open
41+
link: https://huggingface.co/Qwen/Qwen1.5-110B
42+
notes: Weights available through HuggingFace.
4343

4444
weights_endmodel:
4545
class: open
46-
link: https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/tree/main
47-
notes: Made available through HuggingFace
46+
link: https://huggingface.co/jondurbin/airoboros-dpo-110b-3.3
47+
notes: Weights available through HuggingFace.
4848

4949
trainingcode:
5050
class: partial
51-
link: https://gist.github.com/jondurbin/87fc040b92a3073125ed516b04bc6e19
52-
notes: Repo exists for RL data but only a gist exists for model training and architecture
51+
link: https://github.com/jondurbin/qlora
52+
notes: Repo exists only for general fine-tuning implementation.
5353

5454
# documentation:
5555
code:
5656
class: partial
57-
link: https://github.com/jondurbin/airoboros/tree/main/airoboros
58-
notes: What little code available is not very systematically documented
57+
link: https://github.com/jondurbin/qlora
58+
notes: Code is not very systematically documented.
5959

6060
architecture:
61-
class: partial
62-
link: https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/discussions/2#64c29e4c617b36543dedac9a
63-
notes: Some info can be gleaned at link but most remains undocumented
61+
class: closed
62+
link:
63+
notes: Earlier airoboros model versions included architecture information. Latest versions (>3.1.2) do not.
6464

6565
preprint:
6666
class: closed
6767
link:
68-
notes: No preprint found
68+
notes: No preprint found.
6969

7070
paper:
7171
class: closed
7272
link:
73-
notes: No peer-reviewed paper found
73+
notes: No peer-reviewed paper found.
7474

7575
modelcard:
7676
class: partial
77-
link: https://huggingface.co/jondurbin/airoboros-65b-gpt4-1.4
78-
notes: Instructs reader to look up model card for prior 65B Llama1 version
77+
link: https://huggingface.co/jondurbin/airoboros-dpo-110b-3.3
78+
notes: Model card contains some information, mainly relating to inference and licensing.
7979

8080
datasheet:
81-
class: partial
82-
link: https://huggingface.co/datasets/jondurbin/airoboros-gpt4-1.4.1
83-
notes: Datasheet for RL data only
81+
class: closed
82+
link: https://huggingface.co/datasets/jondurbin/airoboros-3.2, https://huggingface.co/datasets/bluemoon-fandom-1-1-rp-cleaned, https://huggingface.co/datasets/boolq, https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1, https://huggingface.co/datasets/LDJnr/Capybara, https://huggingface.co/datasets/jondurbin/cinematika-v0.1, https://huggingface.co/datasets/glaiveai/glaive-function-calling-v2, https://huggingface.co/datasets/grimulkan/LimaRP-augmented, https://huggingface.co/datasets/piqa, https://huggingface.co/datasets/Vezora/Tested-22k-Python-Alpaca, https://huggingface.co/datasets/mattpscott/airoboros-summarization, https://huggingface.co/datasets/unalignment/toxic-dpo-v0.2
83+
notes: Datasheets contain differing levels of documentation. Exact data used as well as data collection and curation procedure unknown.
8484

8585
# access:
8686
package:
87-
class: open
87+
class: partial
8888
link: https://pypi.org/project/airoboros/
89-
notes: Pip package available.
89+
notes: Pip package available, however only supports older versions of airoboros.
9090

9191
api:
9292
class: closed
9393
link:
94-
notes: No API found
94+
notes: No API found.
9595
metaprompt: closed
9696

9797
licenses:
9898
class: partial
99-
link: https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1#licence-and-usage-restrictions
100-
notes: Licensing left ambiguous because of murky status of OpenAI-derived Self-Instruct data
99+
link: https://huggingface.co/jondurbin/airoboros-dpo-110b-3.3#licence-and-usage-restrictions
100+
notes: Licensing left ambiguous because of murky status of OpenAI-derived Self-Instruct data.

0 commit comments

Comments
 (0)