1 252 643

Gurumurthi V Ramanan

GVR

https://surasys.co

AI & ML interests

Recent Activity

liked a model 1 day ago

LiquidAI/LFM2.5-Audio-1.5B

liked a model 3 days ago

Ex0bit/MiniMax-M2.1-PRISM

upvoted an article 4 days ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

View all activity

Organizations

upvoted an article 4 days ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

•

389

upvoted 2 papers 6 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published 8 days ago • 117

Is There a Better Source Distribution than Gaussian? Exploring Source Distributions for Image Flow Matching

Paper • 2512.18184 • Published 19 days ago • 20

upvoted a paper 9 days ago

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Paper • 2512.10942 • Published 27 days ago • 37

upvoted a paper 10 days ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published 15 days ago • 32

upvoted 3 articles 12 days ago

Article

Efficient MultiModal Data Pipeline

Jul 8, 2025

•

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21, 2025

•

247

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11, 2025

•

177

upvoted an article 17 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

21 days ago

•

105

upvoted 2 papers 24 days ago

A Survey of Vibe Coding with Large Language Models

Paper • 2510.12399 • Published Oct 14, 2025 • 49

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

Paper • 2512.10534 • Published 27 days ago • 31

upvoted a paper 25 days ago

One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Paper • 2512.07829 • Published about 1 month ago • 21

upvoted an article 25 days ago

Article

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

26 days ago

•

upvoted 2 papers 26 days ago

CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning

Paper • 2511.18659 • Published Nov 24, 2025 • 19

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 283

upvoted a collection 26 days ago

rnj-1

Collection

5 items • Updated 19 days ago • 39

upvoted a paper 26 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 96

upvoted a paper 27 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published about 1 month ago • 75

upvoted an article 27 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

29 days ago

•

upvoted an article about 1 month ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

565

Gurumurthi V Ramanan

AI & ML interests

Recent Activity

Organizations

GVR's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Efficient MultiModal Data Pipeline

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

We Got Claude to Fine-Tune an Open Source LLM