Wissam Antoun's picture

Wissam Antoun

wissamantoun

·

https://wissamantoun.github.io/

AI & ML interests

LLMs. Robustness to Noise, Tokenization, AI text detection, Arabic NLP, MLOPS

Recent Activity

liked a dataset about 7 hours ago

sarulab-speech/yodas2_sidon

liked a dataset 6 days ago

arbml/CLEANANERCorp

updated a model 6 days ago

almanach/Gaperon-8B-ckpts

View all activity

Organizations

upvoted a collection 10 days ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

A compilation of sparse auto-encoders trained on large language models. • 34 items • Updated Oct 10 • 4

upvoted a paper about 1 month ago

Gaperon: A Peppered English-French Generative Language Model Suite

Paper • 2510.25771 • Published Oct 29 • 15

upvoted a paper about 1 year ago

CamemBERT 2.0: A Smarter French Language Model Aged to Perfection

Paper • 2411.08868 • Published Nov 13, 2024 • 13

upvoted a collection over 1 year ago

Awesome Document AI

A collection of open-source document AI 📄 📝 📈 • 27 items • Updated Mar 11, 2024 • 80

upvoted 2 papers over 1 year ago

Harvesting Textual and Structured Data from the HAL Publication Repository

Paper • 2407.20595 • Published Jul 30, 2024 • 22

mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus

Paper • 2406.08707 • Published Jun 13, 2024 • 17