arvinsingh commited on
Commit
8d9d851
·
verified ·
1 Parent(s): 85709fd

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +61 -0
README.md ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - pytorch
5
+ - diffusion
6
+ - ddpm
7
+ - image-generation
8
+ - cifar10
9
+ datasets:
10
+ - cifar10
11
+ ---
12
+
13
+ # DDPM trained on CIFAR-10
14
+
15
+ A Denoising Diffusion Probabilistic Model (DDPM) trained on CIFAR-10 using distributed training across 6 V100 GPUs.
16
+
17
+ ## Model Description
18
+
19
+ - **Architecture -** U-Net with attention (128 base channels)
20
+ - **Training -** 100 epochs on CIFAR-10 (50,000 images)
21
+ - **Hardware -** 3 nodes × 2 V100-16GB GPUs
22
+ - **Framework -** PyTorch DDP
23
+
24
+ ## Training Details
25
+
26
+ | Parameter | Value |
27
+ |-----------|-------|
28
+ | Batch Size | 64 per GPU (384 effective) |
29
+ | Learning Rate | 2e-4 |
30
+ | Timesteps | 1000 |
31
+ | EMA Decay | 0.9999 |
32
+
33
+ ## Usage
34
+
35
+ ```python
36
+ import torch
37
+ from models import UNet, GaussianDiffusion
38
+
39
+ model = UNet(in_channels=3, out_channels=3, base_channels=128,
40
+ channel_mults=(1,2,2,2), num_res_blocks=2, attention_resolutions=(2,))
41
+ model.load_state_dict(torch.load("model_ema.pt"))
42
+ model.eval()
43
+
44
+ diffusion = GaussianDiffusion(timesteps=1000)
45
+ samples = diffusion.sample(model, image_size=32, batch_size=16, channels=3)
46
+ ```
47
+
48
+ ## Training Code
49
+
50
+ [GitHub Repository](https://github.com/arvinsingh/Darkbird)
51
+
52
+ ## Citation
53
+
54
+ ```bibtex
55
+ @misc{darkbird2026,
56
+ author = {Arvin Singh},
57
+ title = {Darkbird: Distributed Training Examples},
58
+ year = {2026},
59
+ url = {https://github.com/arvinsingh/Darkbird}
60
+ }
61
+ ```