Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
jordan martinelli's picture
3 1 16

jordan martinelli

joSeb5777777777777777777
ยท

AI & ML interests

None yet

Recent Activity

reacted to salma-remyx's post with ๐Ÿš€ 6 days ago
Just trained a 2B coding model to rank candidate AI/ML research ideas against the implicit preferences in a code repository's merge history. The training data comes from a Gaussian Process fit on the accumulated dispositions in VQASynth, where each PR against a deployed project yields a pairwise comparison between the feature branch preferred and the baseline at main. The GP scores candidate papers to synthesize preference pairs, and DPO with LoRA bakes the ranking pipeline into the model's weights. After 1 epoch the model reaches 87.4% reward accuracy on the held-out eval split against 92.3% on training, consistent with learning the task without overfitting. Now, I'm scaling the pipeline to thousands of repos for a generalization test. Dataset: https://huggingface.co/datasets/remyxai/mhpd-dpo-v0 Model: https://huggingface.co/remyxai/mhpd-dpo-qwen3.5-2b-vqasynth Substack: https://remyxai.substack.com/p/the-ai-pm
liked a model 10 days ago
samuelcardillo/Qwen3-Coder-Next-Opus-4.6-Reasoning-Distilled-GGUF
new activity 15 days ago
huggingface/InferenceSupport:arcee-ai/Trinity-Large-Thinking-GGUF
View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs