AI Papers
Foundational Concepts & Architectures:
- Backpropagation
- Convolutional Neural Networks
- Word2Vec Papers by Tomáš Mikolov.
- Sequence to Sequence Learning with Neural Networks
- Attention is All You Need
- ResNet, “Deep Residual Learning for Image Recognition”
Large Language Models & Related Techniques:
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
- Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
- GPT-4 Technical Report
- Claude 3 Model Card
- LLaMA 3 Paper
Specific Models & Applications:
- DeepSeek V3 & V2
- Whisper, Robust Speech Recognition via Large-Scale Weak Supervision
- Latent Diffusion Models (Stable Diffusion) Paper
- DALL-E 3 Scaling Text-to-Image Generation
Benchmarks & Evaluations:
- SWE-Bench Can Language Models Resolve Real-World GitHub Issues?
Curated Lists:
- NeurIPS Test of Time Papers
- Ilya’s Top 30 AI Papers A curated list by Ilya Sutskever, available at https://aman.ai/primers/ai/top-30-papers/.