💡 Tips: Devendra Chaplot (@dchaplot) / X

Devendra Chaplot

682 posts

Devendra Chaplot

@dchaplot

Building superintelligence @xai

SF Bay Area

devendrachaplot.github.io

Joined April 2010

Pinned
Devendra Chaplot
@dchaplot
Mar 13
I'm joining SpaceX and xAI, working closely with Elon and team to build superintelligence. Together SpaceX and xAI combine physical and digital intelligence under a leader who understands hardware at the deepest level. Add a high-agency culture with frontier-scale resources, and
44M
Devendra Chaplot
@dchaplot
Jan 9, 2024
We just released Mixtral 8x7B paper on Arxiv: arxiv.org/abs/2401.04088
436K
Devendra Chaplot
@dchaplot
Oct 10, 2024
We just released Pixtral 12B paper on Arxiv: arxiv.org/abs/2410.07073
90K
Devendra Chaplot
@dchaplot
Dec 11, 2023
Proud to announce: Mixtral 8x7B -- Mixtral of Experts - Free to use under Apache 2.0 license - outperforms Llama 2 70B with 6x faster inference. - matches or outperforms GPT3.5 - masters English, French, Italian, German and Spanish. - seq_len = 32K mistral.ai/news/mixtral-o… 1/N
223K
Devendra Chaplot
@dchaplot
Feb 18, 2025
Career Update: Incredibly fortunate and excited to be part of the founding team at Thinking Machines Lab! thinkingmachines.ai Join us: 6wajk07p.paperform.co
95K
Devendra Chaplot
@dchaplot
Nov 18, 2024
Today, we are announcing two new exciting updates: Pixtral Large: Frontier-class 124B multimodal model, powering the new Le Chat. Brand new Le Chat: With web search, canvas, image-gen, image understanding & more- all for free! 1/3
292K
Devendra Chaplot
@dchaplot
Jul 24, 2024
Super excited to announce Mistral Large 2 - 123B params - fits on a single H100 node - Natively Multilingual - Strong code & reasoning - SOTA function calling - Open-weights for non-commercial usage Blog: mistral.ai/news/mistral-l… Weights: huggingface.co/mistralai/Mist… 1/N
304K
Devendra Chaplot
@dchaplot
Apr 17, 2024
We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…
152K
Devendra Chaplot
@dchaplot
Oct 1, 2025
Announcing our first product: Tinker! Tinker is a training API for everyone! It lets you focus on what matters in LLM training - your data and algorithms - while we handle the heavy lifting of distributed training. You can train your own models using Tinker even if you have no
Thinking Machines
@thinkymachines
Oct 1, 2025
Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!
Tinker
From thinkingmachines.ai
165K
Devendra Chaplot
@dchaplot
May 24, 2024
We just released mistral-finetune, the official repo and guide on how to fine-tune Mistral open-source models using LoRA: github.com/mistralai/mist… Also released Mistral-7B-Instruct-v0.3 with support for function calling with Apache 2.0 license: models.mistralcdn.com/mistral-7b-v0-…
GitHub - mistralai/mistral-finetune
From github.com
68K
Devendra Chaplot
@dchaplot
Feb 13, 2025
Career update: After an incredible journey at Mistral AI, I made the hard decision to leave and pursue another exciting opportunity. Will share more details very soon! Very proud of the Mistral team and their accomplishments, I wish them continued success!
65K
Devendra Chaplot
@dchaplot
Feb 26, 2024
Proud to announce: Mistral Large - 81.2% MMLU - outperforms Claude 2, Gemini 1.0 Pro, GPT-3.5 - multilingual with seq_len = 32K - json mode and function calling Also excited to launch Le Chat (Try it now at chat.mistral.ai) 🔗 mistral.ai/news/mistral-l…
60K
Devendra Chaplot
@dchaplot
Oct 11, 2023
Mistral 7B paper is on Arxiv: arxiv.org/abs/2310.06825
75K
Devendra Chaplot
@dchaplot
Oct 28, 2025
Today we’re excited to add gpt-oss and DeepSeek model families to Tinker - one of our top community requests. With Tinker, you can train a 671B parameter model on your laptop in just a few lines of code. No GPU rentals. No CUDA. No cluster setup. Just train.
Thinking Machines
@thinkymachines
Oct 28, 2025
We just added 4 new models to Tinker from the gpt-oss and DeepSeek-V3.1 families. Sign up for the waitlist: thinkingmachines.ai/tinker/
110K