Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
allegrolab 's Collections
Hubble Datasets
Hubble - Core
Hubble - Interference
Hubble - Timing
Hubble - Paraphrase
Hubble - Architecture

Hubble - Architecture

updated Oct 15, 2025

Two models trained with shallower and deeper transformer architectures to assess how model depth affects memorization.

Upvote
-

  • allegrolab/hubble-1b-100b_toks-double_depth-perturbed-hf

    Text Generation • 1B • Updated Oct 23, 2025 • 6

  • allegrolab/hubble-1b-100b_toks-double_depth-standard-hf

    Text Generation • 1B • Updated Oct 23, 2025 • 4

  • allegrolab/hubble-1b-100b_toks-half_depth-perturbed-hf

    Text Generation • 1B • Updated Oct 23, 2025 • 4

  • allegrolab/hubble-1b-100b_toks-half_depth-standard-hf

    Text Generation • 1B • Updated Oct 23, 2025 • 6

  • allegrolab/hubble-1b-100b_toks-double_depth-perturbed-neox

    Text Generation • Updated Oct 23, 2025

  • allegrolab/hubble-1b-100b_toks-double_depth-standard-neox

    Text Generation • Updated Oct 23, 2025

  • allegrolab/hubble-1b-100b_toks-half_depth-perturbed-neox

    Text Generation • Updated Oct 23, 2025

  • allegrolab/hubble-1b-100b_toks-half_depth-standard-neox

    Text Generation • Updated Oct 23, 2025

  • allegrolab/dclm-baseline-500b_toks

    Updated Oct 23, 2025 • 343

    Note Use revision 'perturbed-100b' for the perturbed models and 'standard' otherwise

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs