Show HN: Text-to-video model from scratch (2 brothers, 2 years, 2B params)
🟠HackerNews by schopra909 ▲ 62 💬 12
technical
View Original Post ↗ No analysis available for this story.
This story was indexed before article generation was enabled.
🤖 Classification Details
Detailed technical writeup of training text-to-video models from scratch. Includes architecture details (T5, VAE, DiT-variant), honest assessment of capabilities/limitations, and open-source release.