Frankiey001 's Collections Fav-papers
updated
Human-like Episodic Memory for Infinite Context LLMs
Paper
• 2407.09450
• Published
• 62
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Paper
• 2407.09435
• Published
• 23
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled
Refusal Training
Paper
• 2407.09121
• Published
• 6
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG
Capabilities
Paper
• 2407.14482
• Published
• 26
Internal Consistency and Self-Feedback in Large Language Models: A
Survey
Paper
• 2407.14507
• Published
• 46
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Paper
• 2407.14057
• Published
• 46
Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix"
Cycle
Paper
• 2407.13833
• Published
• 12
Very Large-Scale Multi-Agent Simulation in AgentScope
Paper
• 2407.17789
• Published
• 35
AppWorld: A Controllable World of Apps and People for Benchmarking
Interactive Coding Agents
Paper
• 2407.18901
• Published
• 35
OpenDevin: An Open Platform for AI Software Developers as Generalist
Agents
Paper
• 2407.16741
• Published
• 76
LAMBDA: A Large Model Based Data Agent
Paper
• 2407.17535
• Published
• 37
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Paper
• 2408.07055
• Published
• 68
Diversity Empowers Intelligence: Integrating Expertise of Software
Engineering Agents
Paper
• 2408.07060
• Published
• 41
TurboEdit: Instant text-based image editing
Paper
• 2408.08332
• Published
• 20
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to
Small-Scale Local LLMs
Paper
• 2408.13467
• Published
• 25
Diffusion Models Are Real-Time Game Engines
Paper
• 2408.14837
• Published
• 126
LongCite: Enabling LLMs to Generate Fine-grained Citations in
Long-context QA
Paper
• 2409.02897
• Published
• 48
WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild
Paper
• 2409.03753
• Published
• 19
Attention Heads of Large Language Models: A Survey
Paper
• 2409.03752
• Published
• 92
Note ...
From MOOC to MAIC: Reshaping Online Teaching and Learning through
LLM-driven Agents
Paper
• 2409.03512
• Published
• 29
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
Paper
• 2409.15277
• Published
• 38
Hallucinating AI Hijacking Attack: Large Language Models and Malicious
Code Recommenders
Paper
• 2410.06462
• Published
• 7
A Flexible Large Language Models Guardrail Development Methodology
Applied to Off-Topic Prompt Detection
Paper
• 2411.12946
• Published
• 22
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper
• 2411.14405
• Published
• 61
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented
LMs
Paper
• 2411.14199
• Published
• 34
Progressive Multimodal Reasoning via Active Retrieval
Paper
• 2412.14835
• Published
• 73
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
• 2412.09871
• Published
• 108
Cosmos World Foundation Model Platform for Physical AI
Paper
• 2501.03575
• Published
• 82
Agent Laboratory: Using LLM Agents as Research Assistants
Paper
• 2501.04227
• Published
• 95
On the Trustworthiness of Generative Foundation Models: Guideline,
Assessment, and Perspective
Paper
• 2502.14296
• Published
• 45
Llama-Nemotron: Efficient Reasoning Models
Paper
• 2505.00949
• Published
• 41
The Devil behind the mask: An emergent safety vulnerability of Diffusion
LLMs
Paper
• 2507.11097
• Published
• 64