AI & ML interests

transformation

Recent Activity

zoeywin  updated a model 17 days ago
ValiantLabs/Qwen3.6-27B-Esper3.1
sequelbox  updated a collection 18 days ago
Esper 3.1
sequelbox  updated a collection 18 days ago
Esper 3.1
View all activity

sequelbox 
posted an update 4 days ago
view post
Post
3540
JUST RELEASED: the Tachibana 4 DeepSeek-V4-Pro dataset and our all-new Tachibana-Agent coding model!

- Questions prioritize real-world, challenging agentic coding tasks across a variety of programming languages and topics. Synthetic prompts utilize a variety of personas, experience levels, and styles of communication to maximize real-world flexibility and usability.
- Areas of focus include back-end and front-end development, systems programming, distributed systems, performance optimization, data structures, databases and data engineering, game and mobile development, security engineering, compiler design, custom tooling, task automation, practical bugfixes, and more!
- A wide variety of emphasized languages improves development capability: Python, C, C++, C#, Go, TypeScript, Java, JavaScript, Rust, Haskell, SQL, Shell, R, Ruby, assembly code, and more!

The new dataset: sequelbox/Tachibana4-DeepSeek-V4-Pro
The new model: sequelbox/Qwen3.6-27B-Tachibana-Agent

We're thrilled to bring this to everyone - try it out and see what you think!

Tachibana 4 is the first of several datasets used for the upcoming Esper 4! See what we're working on and help our releases come out faster: sequelbox/SupportOpenSource

Open source will win :)

love,
allegra
sequelbox 
posted an update 10 days ago
view post
Post
3245
EARLY SNEAK PREVIEW of our first DeepSeek-V4-Pro dataset, Tachibana 4!

Tachibana 4 is our upcoming agentic coding dataset:
- Questions prioritize real-world, challenging agentic coding tasks across a variety of programming languages and topics.
- Areas of focus include back-end and front-end development, systems programming, distributed systems, performance optimization, data structures, databases and data engineering, game and mobile development, security engineering, compiler design, custom tooling, task automation, practical bugfixes, and more!
- A wide variety of emphasized languages improves development capability: Python, C, C++, C#, Go, TypeScript, Java, JavaScript, Rust, Haskell, SQL, Shell, R, Ruby, assembly code, and more!
- Synthethic prompts utilize a variety of personas, experience levels, and styles of communication to maximize real-world flexibility and usability.

Get it now: sequelbox/Tachibana4-DeepSeek-V4-Pro-PREVIEW

These agentic datasets will power the upcoming Esper 4, and whatever you can build! We'll have more finetunes on the way as well! :) we're going to make open source better and better for your work!

If you would like to see Esper 4 and these datasets faster, this is the best way you can help us: sequelbox/SupportOpenSource

for freedom, with love,
allegra
sequelbox 
posted an update 21 days ago
view post
Post
1916
NEW RELEASE: Esper 3.1 for Qwen 3.6!

- Your dedicated DevOps expert: Esper 3.1 maximizes DevOps and architecture helpfulness, powered by high-difficulty DevOps and architecture data generated with DeepSeek-V3.1-Terminus!
- Improved coding performance: challenging code-reasoning datasets stretch DeepSeek-V3.1-Terminus and DeepSeek-V3.2 to the limits, allowing Esper 3.1 to tackle harder coding tasks!
- AI to build AI: our high-difficulty AI expertise data boosts Esper 3.1's MLOps, AI architecture, AI research, and general reasoning skills.

Get it now: ValiantLabs/Qwen3.6-35B-A3B-Esper3.1

We're working on more finetunes for the newest Qwen and Gemma models, and we've also started working on the agentic-first datasets for Esper 4 :) we're going to make open source better and better for your work!

Please note that real life financial and family concerns have popped up and have imposed unfortunate limitations on our ability to devote time to our open-source work :( If you would like to see Esper 4 and our other releases speed up instead of slowing down, this is the best way you can help us: sequelbox/SupportOpenSource

No matter what, we'll keep fighting and we won't give up!

with love,
allegra
  • 1 reply
·
sequelbox 
posted an update 27 days ago
view post
Post
232
Multiple new releases for Gemma 4!

For Gemma 4 31B: Guardpoint, our medical reasoning model, trained on medical knowledge, management, diagnosis, and tasks:
- Structured medical reasoning responses are efficient and informative, cutting token costs for faster inference!
- Wide-ranging knowledge base: trained on a wide variety of medical disciplines, patient types, and query structures!
- High quality medical responses emphasize performance, brevity, specificity, statistical rationality, and openness.

Get Guardpoint for Gemma 4: ValiantLabs/gemma-4-31B-it-Guardpoint

For Gemma 4 E4B and E2B: Shining Valiant 3, our science-reasoning model!
- Science-reasoning: physics, biology, chemistry, compsci, astronomy, Earth science, and information theory.
- AI to build AI: high-quality reasoning performance on AI, MLOps, math and CUDA, complex adaptive and agentic systems, cognition, logic, linguistics, simulation, knowledge management, and more!
- Supplemented creative reasoning and general chat performance.

Get the new SV3 models:
E4B: ValiantLabs/gemma-4-E4B-it-ShiningValiant3
E2B: ValiantLabs/gemma-4-E2B-it-ShiningValiant3

We're working on several things - most excitingly, we've officially started the dataset curation process for Esper 4! We're focused on enhanced agentic capability and higher-dififculty, higher-value tasks this time, very excited to bring this to everyone when we can :)

Help support our releases, donations used for our experimental models and datasets: sequelbox/SupportOpenSource

Fight for open source with us!

for love and friendship,
allegra