Skip to content

Commit a42b785

Browse files
committed
Merge remote-tracking branch 'origin/preview'
2 parents 9cfa29e + 26b4968 commit a42b785

23 files changed

+136
-90
lines changed

.github/workflows/draft-pdf.yml

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
name: Draft PDF
2+
on: [push]
3+
4+
jobs:
5+
paper:
6+
runs-on: ubuntu-latest
7+
name: Paper Draft
8+
steps:
9+
- name: Checkout
10+
uses: actions/checkout@v4
11+
- name: Build draft PDF
12+
uses: openjournals/openjournals-draft-action@master
13+
with:
14+
journal: joss
15+
# This should be the path to the paper within your repo.
16+
paper-path: paper.md
17+
- name: Upload
18+
uses: actions/upload-artifact@v4
19+
with:
20+
name: paper
21+
# This is the output path where Pandoc will write the compiled
22+
# PDF. Note, this should be the same directory as the input
23+
# paper.md
24+
path: paper.pdf

BitNet.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
# Liesenfeld, A. and Dingemanse, M., 2024. Rethinking open source generative AI: open-washing and the EU AI Act. In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (pp. 1774-1787).
1111

1212
system:
13-
name: BitNet b1.58 2B4T
13+
name: BitNet
1414
link: https://huggingface.co/microsoft/bitnet-b1.58-2B-4T
1515
type: text
1616
performanceclass: full

DeepFloyd.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,6 @@ org:
2626
notes: Collaboration between various organizations.
2727

2828
# availability:
29-
3029
datasources_basemodel:
3130
class: partial
3231
link: https://huggingface.co/DeepFloyd/IF-I-XL-v1.0#training

FLUX.1.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,6 @@ org:
2626
notes: Image-generation model start-up.
2727

2828
# availability:
29-
3029
datasources_basemodel:
3130
class: closed
3231
link:

Falcon.yaml

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -64,9 +64,7 @@ hardware_architecture:
6464

6565
preprint:
6666
class: open
67-
link:
68-
- https://arxiv.org/abs/2306.01116
69-
- https://arxiv.org/abs/2311.16867
67+
link: ["https://arxiv.org/abs/2306.01116", "https://arxiv.org/abs/2311.16867"]
7068
notes: First preprint covers the creation and curation of RefinedWeb dataset, but not other aspects of the model. The second preprint provides more details about the model architecture, implementation, evaluation results, and limitations.
7169

7270
paper:

GLM.yaml

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -28,9 +28,7 @@ org:
2828
# availability:
2929
datasources_basemodel:
3030
class: closed
31-
link:
32-
- http://doi.org/10.18653/v1/2022.acl-long.26
33-
- https://arxiv.org/abs/2406.12793
31+
link: ["http://doi.org/10.18653/v1/2022.acl-long.26", "https://arxiv.org/abs/2406.12793"]
3432
notes: Training data not centrally made available, but described in 2022 ACL paper, appears to be mostly public datasets. Preprint also mentions "Our pre-training corpus consists of multilingual (mostly English and Chinese) documents from a mixture of different sources, including webpages, Wikipedia, books, code, and research papers", but does not go into more detail.
3533

3634
datasources_endmodel:

Poro.yaml

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
# Liesenfeld, A. and Dingemanse, M., 2024. Rethinking open source generative AI: open-washing and the EU AI Act. In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (pp. 1774-1787).
1111

1212
system:
13-
name: Poro-34B
13+
name: Poro
1414
link: https://huggingface.co/LumiOpen/Poro-34B
1515
type: text
1616
performanceclass: full
@@ -26,11 +26,6 @@ org:
2626
notes: Silo AI was acquired by AMD in August 2024
2727

2828
# availability:
29-
trainingcode:
30-
class: open
31-
link: https://github.com/LumiOpen/Megatron-DeepSpeed
32-
notes: Custom fork of the Megatron-Deepspeed framework used for training Poro-34B.
33-
3429
datasources_basemodel:
3530
class: partial
3631
link: https://arxiv.org/html/2404.01856v1
@@ -51,6 +46,11 @@ weights_endmodel:
5146
link: https://huggingface.co/LumiOpen/Poro-34B
5247
notes: Final model weights released under Apache 2.0 license.
5348

49+
trainingcode:
50+
class: open
51+
link: https://github.com/LumiOpen/Megatron-DeepSpeed
52+
notes: Custom fork of the Megatron-Deepspeed framework used for training Poro-34B.
53+
5454
# documentation:
5555
code:
5656
class: open

Qwen.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
# Liesenfeld, A. and Dingemanse, M., 2024. Rethinking open source generative AI: open-washing and the EU AI Act. In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (pp. 1774-1787).
1111

1212
system:
13-
name: Qwen3-235B-A22B
13+
name: Qwen
1414
link: https://huggingface.co/Qwen/Qwen3-235B-A22B
1515
type: text
1616
performanceclass: latest

SDXL-Lightning.yaml

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -26,11 +26,6 @@ org:
2626
notes: Chinese technology company that owns Tiktok
2727

2828
# availability:
29-
trainingcode:
30-
class: closed
31-
link:
32-
notes: The training code for SDXL-Lightning has not been publicly released.
33-
3429
datasources_basemodel:
3530
class: partial
3631
link: https://arxiv.org/pdf/2307.01952
@@ -51,6 +46,10 @@ weights_endmodel:
5146
link: https://huggingface.co/ByteDance/SDXL-Lightning
5247
notes: Available through HuggingFace in the form of SafeTensors
5348

49+
trainingcode:
50+
class: closed
51+
link:
52+
notes: The training code for SDXL-Lightning has not been publicly released.
5453

5554
# documentation:
5655
code:

Teuken.yaml

Lines changed: 6 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -26,11 +26,6 @@ org:
2626
notes: Project aiming to develop LLMs in Germany.
2727

2828
# availability:
29-
trainingcode:
30-
class: closed
31-
link:
32-
notes:
33-
3429
datasources_basemodel:
3530
class: partial
3631
link: https://arxiv.org/pdf/2410.08800
@@ -51,6 +46,11 @@ weights_endmodel:
5146
link: https://huggingface.co/openGPT-X/Teuken-7B-instruct-commercial-v0.4
5247
notes: Available via Huggingface repository.
5348

49+
trainingcode:
50+
class: closed
51+
link:
52+
notes:
53+
5454
# documentation:
5555
code:
5656
class: closed
@@ -64,10 +64,7 @@ hardware_architecture:
6464

6565
preprint:
6666
class: open
67-
link:
68-
- https://arxiv.org/abs/2410.03730
69-
- https://arxiv.org/abs/2410.08928
70-
- https://arxiv.org/abs/2410.08800
67+
link: ["https://arxiv.org/abs/2410.03730", "https://arxiv.org/abs/2410.08928", "https://arxiv.org/abs/2410.08800"]
7168
notes: Three corresponding preprints, detailing the models, data, and evaluation.
7269

7370
paper:

0 commit comments

Comments
 (0)