Advanced Deep Learning
This section covers cutting-edge deep learning research topics with mathematical rigor and practical implementations.
Prerequisites:
- Complete 06-neural-networks/
- Advanced mathematics (03-maths/advanced/)
- Understanding of PyTorch/TensorFlow
Audience: Researchers, PhD students, advanced ML practitioners
This is one of the deepest theory-heavy sections in the repo. It is not intended for a first pass through Zero to AI. Come here after you already have strong fundamentals and a clear reason to study one of these research areas.
📚 Table of Contents
Part I: Advanced Generative Models
1. Generative Adversarial Networks (GANs)
- 01_gan_mathematics.ipynb - Traditional GAN theory, game theory perspective
- 02_wgan_theory.ipynb - Wasserstein GAN, optimal transport
- 03_infogan.ipynb - Information-theoretic regularization
- 04_bayesian_gan.ipynb - Bayesian approach to GANs
2. Variational Autoencoders & Extensions
- 05_vae_deep_dive.ipynb - VAE theory, ELBO derivation
- 06_importance_weighted_vae.ipynb - IWAE, tighter bounds
- 07_normalizing_flows.ipynb - Flow-based models, invertible networks
- 08_adversarial_vae.ipynb - Combining VAE and GAN
3. Modern Generative Models
- 09_flow_matching.ipynb - Continuous normalizing flows
- 10_diffusion_models.ipynb - Denoising diffusion, score matching
- 11_mixture_models.ipynb - Mixture Density Networks, Stick-Breaking VAE
Part II: Optimization & Training
4. Variance Reduction Techniques
- 12_rebar_algorithm.ipynb - REBAR for discrete variables
- 13_relax_algorithm.ipynb - RELAX improvements
- 14_gumbel_max_trick.ipynb - Reparameterization for discrete distributions
- 15_gradient_estimators.ipynb - Survey of gradient estimation methods
5. Advanced Optimization
- 16_gradient_descent_research.ipynb - Implicit bias, implicit regularization
- 17_second_order_methods.ipynb - Natural gradient, K-FAC
- 18_adaptive_learning_rates.ipynb - Adam variants, lookahead optimizers
Part III: Model Understanding
6. Neural Network Theory
- 19_neural_tangent_kernel.ipynb - NTK theory, infinite-width limits
- 20_neural_ode.ipynb - Continuous depth networks
- 21_adjoint_methods.ipynb - Memory-efficient backpropagation
7. Attention & Transformers
- 22_attention_variants.ipynb - Linear attention, efficient transformers
- 23_sparse_attention.ipynb - Longformer, BigBird patterns
- 24_rotary_embeddings.ipynb - RoPE, ALiBi positional encodings
- 25_moe_transformers.ipynb - Mixture of Experts architectures
Part IV: Advanced Applications
8. 3D Computer Vision
- 26_camera_models.ipynb - Intrinsic/extrinsic parameters
- 27_epipolar_geometry.ipynb - Fundamental matrix, essential matrix
- 28_3d_reconstruction.ipynb - Structure from motion
- 29_depth_estimation.ipynb - Monocular depth prediction
- 30_3d_pose_estimation.ipynb - Multi-person, multi-view pose
9. Advanced NLP
- 31_transformer_deep_dive.ipynb - Advanced transformer architectures
- 32_efficient_transformers.ipynb - Linformer, Performer
- 33_multimodal_transformers.ipynb - Vision-language models
Part V: Special Topics
10. Probabilistic Deep Learning
- 34_bayesian_neural_nets.ipynb - Uncertainty quantification
- 35_neural_processes.ipynb - Meta-learning for functions
- 36_gaussian_processes_nn.ipynb - GP connections to deep learning
11. Modern Architectures
- 37_capsule_networks.ipynb - Dynamic routing, capsule theory
- 38_graph_neural_networks.ipynb - GCN, GAT, message passing
- 39_neural_architecture_search.ipynb - AutoML, DARTS
🎯 Learning Paths
Path 1: Generative Modeling Expert
01-04 GANs → 05-08 VAEs → 09-11 Modern Generative → 34-36 ProbabilisticPath 2: Optimization Researcher
16-18 Advanced Optimization → 12-15 Variance Reduction → 19-21 TheoryPath 3: Computer Vision Specialist
26-30 3D Computer Vision → 01-04 GANs → 10 Diffusion ModelsPath 4: Transformer/NLP Expert
22-25 Advanced Attention → 31-33 Advanced NLP → 37-39 Modern ArchitecturesPath 5: Complete Research Track
Work through all notebooks sequentially
How To Use This Section Well
- Pick one research path first instead of trying to complete all 39 notebooks in order.
- Read the linked papers selectively and implement one idea deeply.
- Use this phase to deepen a specialization, not to replace the practical core path.
📖 Key Research Papers
Generative Models
- GANs: “Generative Adversarial Networks” (Goodfellow et al., 2014)
- W-GAN: “Wasserstein GAN” (Arjovsky et al., 2017)
- VAE: “Auto-Encoding Variational Bayes” (Kingma & Welling, 2013)
- Diffusion: “Denoising Diffusion Probabilistic Models” (Ho et al., 2020)
Optimization
- REBAR: “REBAR: Low-variance gradient estimates” (Tucker et al., 2017)
- NTK: “Neural Tangent Kernel” (Jacot et al., 2018)
- Neural ODE: “Neural Ordinary Differential Equations” (Chen et al., 2018)
Transformers
- Original: “Attention Is All You Need” (Vaswani et al., 2017)
- RoPE: “RoFormer: Enhanced Transformer with Rotary Position Embedding” (Su et al., 2021)
- Efficient: “Linformer”, “Performer”, “Longformer” (2020)
3D Vision
- Structure from Motion: Hartley & Zisserman, “Multiple View Geometry”
- Depth estimation: Recent survey papers
Full references in individual notebooks.
🚀 Quick Start
# Install dependencies
pip install torch torchvision matplotlib numpy scipy
pip install transformers diffusers # For modern models
# Optional: 3D vision libraries
pip install opencv-python open3d
# Start with GAN mathematics
jupyter notebook 01_gan_mathematics.ipynb💻 Code Implementation
Each notebook includes:
From Scratch
- ✅ Pure NumPy/PyTorch implementations
- 📐 Mathematical derivations
- 🔬 Step-by-step explanations
Production Ready
- 🚀 Using modern libraries (Hugging Face, etc.)
- ⚡ Optimized implementations
- 🏭 Best practices
Visualizations
- 📊 Training dynamics
- 🎨 Generated samples
- 📈 Metrics and comparisons
🎓 Connection to Course
This section extends:
| Foundation | Advanced Extension |
|---|---|
| 06-neural-networks/05_transformer | 22-25 Advanced Attention |
| 13-multimodal/ | 33 Multimodal Transformers |
| 12-llm-finetuning/ | 16-18 Advanced Optimization |
| 08-rag/ | 19-21 Neural Network Theory |
📊 Practical Projects
Apply what you learn:
- GAN Art Generator: Train W-GAN on art datasets
- VAE for Molecules: Generate novel molecular structures
- 3D Scene Reconstruction: Build SfM pipeline
- Efficient Transformer: Implement linear attention
- Neural ODE Classifier: Continuous depth networks
Project templates included in notebooks.
🔬 Research Implementation Notes
Reproducibility
- Exact hyperparameters from papers
- Random seeds for reproducibility
- Multiple runs with error bars
Computational Requirements
- 🟢 CPU-friendly: Theory, small demos
- 🟡 GPU recommended: Most models
- 🔴 Multi-GPU: Large-scale training
Hardware requirements noted in each notebook.
🤝 Based on Research From
- Prof. Yida Xu - Machine learning research notes
- DeeCamp Lectures - Advanced deep learning seminars
- Recent Publications - 2018-2024 research papers
- Industry Practices - Production-grade implementations
⚠️ Difficulty Level
Research/Graduate Level 🔴🔴🔴🔴
Prerequisites:
- ✅ Strong mathematics (calculus, linear algebra, probability)
- ✅ Deep learning fundamentals
- ✅ PyTorch proficiency
- ✅ Research paper reading experience
- ✅ Graduate-level theoretical understanding
📬 Questions & Contributions
- Issues: Report bugs or ask questions
- Discussions: Theoretical discussions, paper recommendations
- PRs: Contribute new implementations or improvements
Tags: advanced-dl, research, generative-models, optimization
🎯 Learning Objectives
After completing this section, you will:
- ✅ Understand cutting-edge generative model theory
- ✅ Implement advanced optimization techniques
- ✅ Master variance reduction for discrete variables
- ✅ Build efficient transformer architectures
- ✅ Apply deep learning to 3D computer vision
- ✅ Understand theoretical foundations (NTK, Neural ODE)
- ✅ Read and implement recent research papers
- ✅ Contribute to ML research
What Comes Next
- Continue to ../28-practical-data-science/README.md if you want to reconnect theory to applied work.
- Continue to ../29-ai-hardware-llm-validation/README.md if your interest shifts toward systems performance and deployment constraints.
- Revisit ../12-llm-finetuning/README.md or ../13-multimodal/README.md if you want to apply advanced concepts in more product-oriented settings.
From Research to Reality 🚀🔬
“Research is what I’m doing when I don’t know what I’m doing.” - Wernher von Braun
Let’s discover together! 🌟