voidism commited on
Commit
0f02780
·
verified ·
1 Parent(s): 4d59d04

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -34
README.md CHANGED
@@ -1,34 +1,35 @@
1
- ---
2
- license: llama3.1
3
- ---
4
-
5
- # SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models
6
-
7
-
8
- Paper: https://arxiv.org/abs/2502.09604
9
- Authors: [Yung-Sung Chuang](https://people.csail.mit.edu/yungsung/)†, [Benjamin Cohen-Wang](https://bencw99.github.io/)†, [Shannon Zejiang Shen](https://www.szj.io/)†, [Zhaofeng Wu](https://zhaofengwu.github.io/)†, [Hu Xu](https://howardhsu.github.io/)‡, [Xi Victoria Lin](https://victorialin.org/)‡, [James Glass](https://people.csail.mit.edu/jrg/)†, [Shang-Wen Li](https://swdanielli.github.io/)‡, [Wen-tau Yih](https://scottyih.org/)‡
10
- † Massachusetts Institute of Technology, ‡ Meta AI
11
-
12
- ![main-fig](https://github.com/voidism/SelfCite/raw/main/SelfCite.png)
13
-
14
- This model is a reproduction of the SimPO fine-tuned model from a Llama-3.1-8B-Instruct model, which was first trained with SFT data from ContextCite (256 calls). This is the fully self-supervised setting experiment in our paper.
15
- Please refer to our GitHub repository for usage and more details: https://github.com/voidism/SelfCite
16
-
17
- ## Citation
18
- Please cite our paper as well as LongCite if they are helpful to your work!
19
-
20
- ```bibtex
21
- @inproceedings{chuang2025selfcite,
22
- title={SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models},
23
- author={Yung-Sung Chuang and Benjamin Cohen-Wang and Shannon Zejiang Shen and Zhaofeng Wu and Hu Xu and Xi Victoria Lin and James Glass and Shang-Wen Li and Wen-tau Yih},
24
- journal={arXiv preprint arXiv:25xx.xxxxx},
25
- year={2025}
26
- }
27
-
28
- @article{zhang2024longcite,
29
- title = {LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA}
30
- author={Jiajie Zhang and Yushi Bai and Xin Lv and Wanjun Gu and Danqing Liu and Minhao Zou and Shulin Cao and Lei Hou and Yuxiao Dong and Ling Feng and Juanzi Li},
31
- journal={arXiv preprint arXiv:2409.02897},
32
- year={2024}
33
- }
34
- ```
 
 
1
+ ---
2
+ license: llama3.1
3
+ ---
4
+
5
+ # SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models
6
+
7
+
8
+ Paper: https://arxiv.org/abs/2502.09604
9
+ Authors: [Yung-Sung Chuang](https://people.csail.mit.edu/yungsung/)†, [Benjamin Cohen-Wang](https://bencw99.github.io/)†, [Shannon Zejiang Shen](https://www.szj.io/)†, [Zhaofeng Wu](https://zhaofengwu.github.io/)†, [Hu Xu](https://howardhsu.github.io/)‡, [Xi Victoria Lin](https://victorialin.org/)‡, [James Glass](https://people.csail.mit.edu/jrg/)†, [Shang-Wen Li](https://swdanielli.github.io/)‡, [Wen-tau Yih](https://scottyih.org/)‡
10
+ † Massachusetts Institute of Technology, ‡ Meta AI
11
+
12
+ ![main-fig](https://github.com/facebookresearch/SelfCite/raw/main/SelfCite.png)
13
+
14
+ This model is a reproduction of the SelfCite 8B SimPO fine-tuned model, trained by running the official training scripts from the [SelfCite repository](https://github.com/facebookresearch/SelfCite/tree/main/training).
15
+ The model is initialized from [Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct), and then first trained with SFT data from [ContextCite](https://github.com/MadryLab/context-cite) (256 calls). This is the fully self-supervised setting experiment in the [SelfCite paper](https://arxiv.org/abs/2502.09604).
16
+ Please refer to our GitHub repository for usage and more details: https://github.com/facebookresearch/SelfCite
17
+
18
+ ## Citation
19
+ Please cite our paper as well as LongCite if they are helpful to your work!
20
+
21
+ ```bibtex
22
+ @inproceedings{chuang2025selfcite,
23
+ title={SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models},
24
+ author={Yung-Sung Chuang and Benjamin Cohen-Wang and Shannon Zejiang Shen and Zhaofeng Wu and Hu Xu and Xi Victoria Lin and James Glass and Shang-Wen Li and Wen-tau Yih},
25
+ journal={arXiv preprint arXiv:2502.09604},
26
+ year={2025}
27
+ }
28
+
29
+ @article{zhang2024longcite,
30
+ title = {LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA}
31
+ author={Jiajie Zhang and Yushi Bai and Xin Lv and Wanjun Gu and Danqing Liu and Minhao Zou and Shulin Cao and Lei Hou and Yuxiao Dong and Ling Feng and Juanzi Li},
32
+ journal={arXiv preprint arXiv:2409.02897},
33
+ year={2024}
34
+ }
35
+ ```