Log inSign up
Devendra Chaplot
682 posts
user avatar
Devendra Chaplot
@dchaplot
Building superintelligence @xai
SF Bay Area
devendrachaplot.github.io
Joined April 2010
504
Following
66.6K
Followers
  • Pinned
    user avatar
    Devendra Chaplot
    @dchaplot
    Mar 13
    I'm joining SpaceX and xAI, working closely with Elon and team to build superintelligence. Together SpaceX and xAI combine physical and digital intelligence under a leader who understands hardware at the deepest level. Add a high-agency culture with frontier-scale resources, and
    44M
  • user avatar
    Devendra Chaplot
    @dchaplot
    Jan 9, 2024
    We just released Mixtral 8x7B paper on Arxiv: arxiv.org/abs/2401.04088
    436K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Oct 10, 2024
    We just released Pixtral 12B paper on Arxiv: arxiv.org/abs/2410.07073
    90K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Dec 11, 2023
    Proud to announce: Mixtral 8x7B -- Mixtral of Experts - Free to use under Apache 2.0 license - outperforms Llama 2 70B with 6x faster inference. - matches or outperforms GPT3.5 - masters English, French, Italian, German and Spanish. - seq_len = 32K mistral.ai/news/mixtral-o… 1/N
    223K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Feb 18, 2025
    Career Update: Incredibly fortunate and excited to be part of the founding team at Thinking Machines Lab! thinkingmachines.ai Join us: 6wajk07p.paperform.co
    95K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Nov 18, 2024
    Today, we are announcing two new exciting updates: Pixtral Large: Frontier-class 124B multimodal model, powering the new Le Chat. Brand new Le Chat: With web search, canvas, image-gen, image understanding & more- all for free! 1/3
    292K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Jul 24, 2024
    Super excited to announce Mistral Large 2 - 123B params - fits on a single H100 node - Natively Multilingual - Strong code & reasoning - SOTA function calling - Open-weights for non-commercial usage Blog: mistral.ai/news/mistral-l… Weights: huggingface.co/mistralai/Mist… 1/N
    304K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Apr 17, 2024
    We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…
    152K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Oct 1, 2025
    Announcing our first product: Tinker! Tinker is a training API for everyone! It lets you focus on what matters in LLM training - your data and algorithms - while we handle the heavy lifting of distributed training. You can train your own models using Tinker even if you have no
    user avatar
    Thinking Machines
    @thinkymachines
    Oct 1, 2025
    Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!
    Tinker
    Tinker
    From thinkingmachines.ai
    165K
  • user avatar
    Devendra Chaplot
    @dchaplot
    May 24, 2024
    We just released mistral-finetune, the official repo and guide on how to fine-tune Mistral open-source models using LoRA: github.com/mistralai/mist… Also released Mistral-7B-Instruct-v0.3 with support for function calling with Apache 2.0 license: models.mistralcdn.com/mistral-7b-v0-…
    GitHub - mistralai/mistral-finetune
    From github.com
    68K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Feb 13, 2025
    Career update: After an incredible journey at Mistral AI, I made the hard decision to leave and pursue another exciting opportunity. Will share more details very soon! Very proud of the Mistral team and their accomplishments, I wish them continued success!
    65K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Feb 26, 2024
    Proud to announce: Mistral Large - 81.2% MMLU - outperforms Claude 2, Gemini 1.0 Pro, GPT-3.5 - multilingual with seq_len = 32K - json mode and function calling Also excited to launch Le Chat (Try it now at chat.mistral.ai) 🔗 mistral.ai/news/mistral-l…
    60K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Oct 11, 2023
    Mistral 7B paper is on Arxiv: arxiv.org/abs/2310.06825
    75K
  • user avatar
    Devendra Chaplot
    @dchaplot
    Oct 28, 2025
    Today we’re excited to add gpt-oss and DeepSeek model families to Tinker - one of our top community requests. With Tinker, you can train a 671B parameter model on your laptop in just a few lines of code. No GPU rentals. No CUDA. No cluster setup. Just train.
    user avatar
    Thinking Machines
    @thinkymachines
    Oct 28, 2025
    We just added 4 new models to Tinker from the gpt-oss and DeepSeek-V3.1 families. Sign up for the waitlist: thinkingmachines.ai/tinker/
    110K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms of Service|Privacy Policy|Cookie Policy|Accessibility|Ads info|© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
✕

Wait! Don't Go Yet 🚀

Get our FREE eBook "10 Programming Tips That Changed Everything" when you subscribe!

No spam. Unsubscribe anytime.