Resources
- The Little Book of Deep Learning - François Fleuret
- François Fleuret’s deep learning course
- Jeremy Howard’s courses
- Karpathy’s LLM Tokenizer Video
- @arpitingle’s ML Resources Stash
- Deep Learning (MIT Press Book)
- this book covers everything up to just before transformers
- LocalModelsPapers rentry link
- Generative AI Handbook
Reading List
- LLM Reading List
- Unlimiformer - kNN inside transformers
- How far can you train a model on a single consume GPU
- Karpathy’s course on building NNs from scratch
- Reinforcement Learning (intro)
- A Meticulous Guide to Advances in Deep Learning Efficiency over the Years
links below from yacine.ca
- https://arxiv.org/abs/2209.04836
- Git re-basin
- https://arxiv.org/pdf/2212.04089.pdf
- editing models with task arithmetic
- Ologs (ontology log)
- GLaM
- DEMix Layers
- Branch-Train-Merge
- Cluster-BTM
- SMEAR
- TIES-Merging
- AdapterFusion
- KNN Zero-Shot Inference
- Cross-Task Skills with Task-Level Mixture-of-Experts
- Mixture-of-Supernets
- Sparse Upcycling
- AdaMix
- Prompting Large Language Models with Speech Recognition Abilities
- https://arxiv.org/pdf/2307.11795.pdf
- speech recognition with palm-e like strategy
https://dreamtuner-diffusion.github.io/ - ipadapter but different https://arxiv.org/pdf/2312.13789.pdf - how i beat the big wigs https://arxiv.org/abs/2312.09608 - faster stable diffusion by skipping unecessary bits https://github.com/damo-vilab/AnyDoor - instruct edit + https://old.reddit.com/r/StableDiffusion/comments/18kd0na/code_for_anydoor_zeroshot_objectlevel_image/ dont forget - diffusion slider demo https://github.com/Kevin-thu/DiffMorpher?tab=readme-ov-file https://arxiv.org/pdf/2312.01943.pdf - anime https://huggingface.co/spaces/JingyeChen22/TextDiffuser-2 https://github.com/open-mmlab/PIA - video generator https://github.com/cumulo-autumn/StreamDiffusion - turbo go fast