Skip to content

Commit 6d53bb8

Browse files
committed
Merge branch 'preview'
2 parents 12b3fd4 + f53d02e commit 6d53bb8

File tree

13 files changed

+147
-148
lines changed

13 files changed

+147
-148
lines changed

Claire.yaml

Lines changed: 10 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -36,9 +36,9 @@ datasources_basemodel:
3636
notes: Based on Falcon
3737

3838
datasources_endmodel:
39-
class: partial
40-
link: https://github.com/bofenghuang/vigogne
41-
notes: edited Alpaca
39+
class: full
40+
link: https://huggingface.co/datasets/OpenLLM-France/Claire-Dialogue-French-0.1
41+
notes: Dataset published on HuggingFace.
4242

4343
weights_basemodel:
4444
class: open
@@ -67,9 +67,9 @@ hardware_architecture:
6767
notes:
6868

6969
preprint:
70-
class: closed
71-
link:
72-
notes:
70+
class: partial
71+
link: https://arxiv.org/pdf/2311.16840
72+
notes: Preprint released for training dataset, however not for end model.
7373

7474
paper:
7575
class: closed
@@ -82,9 +82,9 @@ modelcard:
8282
notes:
8383

8484
datasheet:
85-
class: closed
86-
link:
87-
notes:
85+
class: partial
86+
link: https://huggingface.co/datasets/OpenLLM-France/Claire-Dialogue-French-0.1
87+
notes: French language dataset reasonably documented.
8888

8989
# access:
9090
package:
@@ -101,5 +101,4 @@ api:
101101
licenses:
102102
class: partial
103103
link:
104-
notes: "cc-by-nc-sa-4.0"
105-
104+
notes: "cc-by-nc-sa-4.0"

CodeGeeX.yaml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -68,13 +68,13 @@ hardware_architecture:
6868

6969
preprint:
7070
class: open
71-
link: https://arxiv.org/pdf/2406.12793
72-
notes:
71+
link: https://arxiv.org/abs/2303.17568v2
72+
notes: Preprint published on arXiv.
7373

7474
paper:
7575
class: closed
76-
link:
77-
notes:
76+
link: https://dl.acm.org/doi/abs/10.1145/3580305.3599790
77+
notes: Paper published in ACM.
7878

7979
modelcard:
8080
class: open

CodeGen.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -68,8 +68,8 @@ hardware_architecture:
6868

6969
preprint:
7070
class: open
71-
link: https://huggingface.co/papers/2305.02309
72-
notes:
71+
link: https://arxiv.org/pdf/2305.02309
72+
notes: paper published on arXiv.
7373

7474
paper:
7575
class: closed

CodeLlama.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -73,8 +73,8 @@ preprint:
7373

7474
paper:
7575
class: closed
76-
link:
77-
notes: No peer-reviewed paper found.
76+
link: https://conf.researchr.org/details/icse-2024/llm4code-2024-papers/2/Code-Llama-Open-Foundation-Models-for-Code
77+
notes: No peer-reviewed paper found. Preprint presented as keynote in ICSE24.
7878

7979
modelcard:
8080
class: partial

CogView.yaml

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@
1515

1616
system:
1717
name: CogView
18-
link: https://huggingface.co/THUDM/CogView4-6B
18+
link: https://huggingface.co/zai-org/CogView4-6B
1919
type: image
2020
performanceclass: full
2121
basemodelname: CogView4-6B
@@ -26,7 +26,7 @@ system:
2626

2727
org:
2828
name: Zhipu AI
29-
link: https://github.com/THUDM
29+
link: https://github.com/zai-org
3030
notes: "Zhipu AI, one of China's AI tigers."
3131

3232
# availability:
@@ -42,17 +42,17 @@ datasources_endmodel:
4242

4343
weights_basemodel:
4444
class: open
45-
link: https://huggingface.co/THUDM/CogView4-6B
45+
link: https://huggingface.co/zai-org/CogView4-6B
4646
notes: "Weights made available on HuggingFace."
4747

4848
weights_endmodel:
4949
class: open
50-
link: https://huggingface.co/THUDM/CogView4-6B
50+
link: https://huggingface.co/zai-org/CogView4-6B
5151
notes: "Weights made available on HuggingFace."
5252

5353
trainingcode:
5454
class: closed
55-
link: https://github.com/THUDM/CogView4
55+
link: https://github.com/zai-org/CogView4
5656
notes: Only inference code made available on GitHub.
5757

5858
# documentation:
@@ -78,7 +78,7 @@ paper:
7878

7979
modelcard:
8080
class: partial
81-
link: https://huggingface.co/THUDM/CogView4-6B
81+
link: https://huggingface.co/zai-org/CogView4-6B
8282
notes: "Model card only contains information about inference and licensing."
8383

8484
datasheet:
@@ -100,5 +100,5 @@ api:
100100

101101
licenses:
102102
class: open
103-
link: https://huggingface.co/THUDM/CogView4-6B#license
103+
link: https://huggingface.co/zai-org/CogView4-6B#license
104104
notes: "Apache-2.0, an OSI-approved license."

Cosmos.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -67,9 +67,9 @@ hardware_architecture:
6767
notes:
6868

6969
preprint:
70-
class: closed
70+
class: open
7171
link: https://arxiv.org/abs/2501.03575
72-
notes:
72+
notes: preprint published on arXiv.
7373

7474
paper:
7575
class: closed

DeepFloyd.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -74,7 +74,7 @@ preprint:
7474

7575
paper:
7676
class: closed
77-
link:
77+
link: https://dl.acm.org/doi/10.5555/3600270.3602913
7878
notes: No paper found.
7979

8080
modelcard:

DeepHermes.yaml

Lines changed: 31 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -15,14 +15,14 @@
1515

1616
system:
1717
name: DeepHermes
18-
link: https://huggingface.co/NousResearch/DeepHermes-3-Llama-3-8B-Preview
18+
link: https://huggingface.co/NousResearch/Hermes-4-70B
1919
type: text
2020
performanceclass: full
21-
basemodelname: Llama-3.1-8B
22-
endmodelname: DeepHermes-3-Llama-3-8B-Preview
21+
basemodelname: Llama-3.1-70B
22+
endmodelname: Hermes-4-70B
2323
endmodellicense: Llama 3 Community License Agreement
24-
releasedate: 2025-02
25-
notes: Evaluated model only available as preview.
24+
releasedate: 2025-08
25+
notes: Hybrid reasoning model based on Llama.
2626

2727
org:
2828
name: Nous Research
@@ -36,69 +36,69 @@ datasources_basemodel:
3636
notes: based on LLama3
3737

3838
datasources_endmodel:
39-
class: closed
40-
link:
41-
notes: mix of synthetic data
39+
class: partial
40+
link: https://huggingface.co/datasets/NousResearch/Hermes-3-Dataset
41+
notes: Mix of synthetic data. Data for previous model version made available.
4242

4343
weights_basemodel:
4444
class: partial
45-
link: https://huggingface.co/meta-llama/Meta-Llama-3.1-8B
45+
link: https://huggingface.co/meta-llama/Meta-Llama-3.1-70B
4646
notes: Inspecting the training weights requires signing Meta Llama 3.1's bespoke 'community license', not an OSI recognised open license
4747

4848
weights_endmodel:
4949
class: open
50-
link: https://huggingface.co/NousResearch/DeepHermes-3-Llama-3-8B-Preview
51-
notes:
50+
link: https://huggingface.co/NousResearch/Hermes-4-70B
51+
notes: Model made available through HuggingFace.
5252

5353
trainingcode:
54-
class: closed
55-
link:
56-
notes:
54+
class: full
55+
link: https://github.com/NousResearch/atropos
56+
notes: Training code published on GitHub.
5757

5858
# documentation:
5959
code:
60-
class: closed
61-
link:
62-
notes:
60+
class: open
61+
link: https://huggingface.co/datasets/NousResearch/Hermes-3-Dataset
62+
notes: Code comprehensively documented.
6363

6464
hardware_architecture:
65-
class: closed
66-
link:
67-
notes:
65+
class: open
66+
link: https://arxiv.org/pdf/2508.18255
67+
notes: Hardware setup comprehensively documented in paper.
6868

6969
preprint:
70-
class: closed
71-
link:
72-
notes:
70+
class: open
71+
link: https://arxiv.org/pdf/2508.18255
72+
notes: Preprint published on arXiv.
7373

7474
paper:
7575
class: closed
7676
link:
77-
notes:
77+
notes: No peer-reviewed paper found.
7878

7979
modelcard:
80-
class: open
81-
link: https://huggingface.co/NousResearch/DeepHermes-3-Llama-3-8B-Preview
82-
notes:
80+
class: partial
81+
link: https://huggingface.co/NousResearch/Hermes-4-70B
82+
notes: Model card primarily contains usage information.
8383

8484
datasheet:
8585
class: closed
8686
link:
87-
notes:
87+
notes: No datasheet found.
8888

8989
# access:
9090
package:
9191
class: closed
9292
link:
93-
notes:
93+
notes: No package found.
9494

9595
api:
9696
class: closed
9797
link:
98-
notes:
98+
notes: No API found.
9999
metaprompt: closed
100100

101101
licenses:
102102
class: closed
103-
link:
103+
link: https://huggingface.co/datasets/NousResearch/Hermes-3-Dataset
104104
notes: Meta custom license

FLUX.1.yaml

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -15,11 +15,11 @@
1515

1616
system:
1717
name: FLUX.1
18-
link: https://huggingface.co/black-forest-labs/FLUX.1-dev
18+
link: https://huggingface.co/black-forest-labs/FLUX.1-Krea-dev
1919
type: image
2020
performanceclass: full
2121
basemodelname: FLUX.1-Dev
22-
endmodelname: FLUX.1-Dev
22+
endmodelname: FLUX.1-Krea-dev
2323
endmodellicense: FLUX.1 [dev] Non-Commercial License
2424
releasedate: 2024-07
2525
notes: Rectified flow image-generation model.
@@ -47,7 +47,7 @@ weights_basemodel:
4747

4848
weights_endmodel:
4949
class: partial
50-
link: https://huggingface.co/black-forest-labs/FLUX.1-dev
50+
link: https://huggingface.co/black-forest-labs/FLUX.1-Krea-dev
5151
notes: Gated model made available through HuggingFace.
5252

5353
trainingcode:
@@ -69,8 +69,8 @@ hardware_architecture:
6969

7070
preprint:
7171
class: closed
72-
link:
73-
notes: No preprint found.
72+
link: https://www.krea.ai/blog/flux-krea-open-source-release
73+
notes: No preprint found. Blog post released.
7474

7575
paper:
7676
class: closed
@@ -79,7 +79,7 @@ paper:
7979

8080
modelcard:
8181
class: partial
82-
link: https://huggingface.co/black-forest-labs/FLUX.1-dev
82+
link: https://huggingface.co/black-forest-labs/FLUX.1-Krea-dev
8383
notes: Model card primarily contains inference and license information.
8484

8585
datasheet:
@@ -101,5 +101,5 @@ api:
101101

102102
licenses:
103103
class: closed
104-
link: https://huggingface.co/black-forest-labs/FLUX.1-dev#license
104+
link: https://huggingface.co/black-forest-labs/FLUX.1-Krea-dev#license
105105
notes: FLUX.1 [dev] Non-Commercial License, not an OSI-approved license.

0 commit comments

Comments
 (0)