A curated list of awesome AI resources—including papers, datasets, projects, and more—organized by field.
- Computer Vision
- Natural Language Processing (NLP)
- Transformers and LLMs
- Reinforcement Learning
- Papers
- Datasets
- Projects
- Books
- Conferences and Journals
- Blogs and Websites
- Tools and Platforms
- Contributing
- License
- OpenCV - Open Source Computer Vision Library.
- TensorFlow - An end-to-end open-source platform for machine learning.
- PyTorch - An open-source machine learning library.
- Keras - Deep learning API written in Python.
- Caffe - A deep learning framework made with expression, speed, and modularity in mind.
- MXNet - A flexible and efficient library for deep learning.
- Detectron2 - Facebook AI Research's next-generation software system for object detection and segmentation.
- Albumentations - Fast image augmentation library.
- SimpleCV - An open-source framework for building computer vision applications.
- MMDetection - Open-source object detection toolbox based on PyTorch.
- YOLO (You Only Look Once) - Real-time object detection system.
- Mask R-CNN - Implementation of Mask R-CNN on Python and Keras.
- DeepLab - Semantic image segmentation with deep learning.
- OpenPose - Real-time multi-person keypoint detection library.
- StyleGAN - Official TensorFlow implementation of the StyleGAN generator.
- CycleGAN - Unpaired image-to-image translation.
- DeepDream - Neural network art project.
- FaceNet - A unified embedding for face recognition and clustering.
- DensePose - Dense human pose estimation in the wild.
- DALL·E - Creating images from text descriptions.
- CS231n: Convolutional Neural Networks for Visual Recognition - Stanford course on CNNs for visual recognition.
- Fast.ai Computer Vision Course - Practical deep learning for coders.
- Deep Learning Specialization by Andrew Ng - Covers convolutional networks and more.
- Udacity's Intro to Computer Vision - Introduction to computer vision.
- PyImageSearch - Tutorials on computer vision and OpenCV.
- NLTK - Natural Language Toolkit.
- spaCy - Industrial-strength NLP in Python.
- Gensim - Topic modeling for humans.
- Stanford CoreNLP - A suite of core NLP tools.
- OpenNMT - Open-source toolkit for neural machine translation.
- AllenNLP - An open-source NLP research library.
- Flair - A simple framework for state-of-the-art NLP.
- TextBlob - Simplified text processing.
- Polyglot - Multilingual text processing toolkit.
- Stanza - A collection of accurate and efficient tools for the linguistic analysis of many human languages.
- BERT - Pre-training of Deep Bidirectional Transformers for Language Understanding.
- GPT-3 - OpenAI's Generative Pre-trained Transformer 3.
- ELMo - Deep contextualized word representations.
- Word2Vec - Efficient estimation of word representations in vector space.
- fastText - Library for efficient learning of word representations and sentence classification.
- Universal Sentence Encoder - Encoding sentences into embedding vectors.
- Transformer-XL - Attentive language models beyond a fixed-length context.
- OpenAI Codex - An AI system that translates natural language to code.
- ULMFiT - Universal Language Model Fine-tuning.
- CS224n: Natural Language Processing with Deep Learning - Stanford course on NLP with deep learning.
- Natural Language Processing Specialization by deeplearning.ai - Coursera specialization on NLP.
- fast.ai NLP Course - Practical deep learning for coders focused on NLP.
- Oxford Deep NLP 2017 - Oxford's deep learning for NLP course.
- Hugging Face Tutorials - Tutorials on Transformers library.
- Hugging Face Transformers - State-of-the-art NLP for PyTorch and TensorFlow 2.0.
- Megatron-LM - Training multi-billion parameter language models using model parallelism.
- Fairseq - Facebook AI Research sequence-to-sequence toolkit.
- DeepSpeed - High-performance deep learning optimization library.
- Mesh TensorFlow - Model parallelism for large models.
- GPT-Neo - Open-source implementation of GPT-3-like models.
- SentenceTransformers - Multilingual sentence embeddings using BERT.
- OpenAI Triton - Open-source GPU programming language.
- GPT-4 - OpenAI's latest language model.
- T5: Text-to-Text Transfer Transformer - Exploring a text-to-text approach to NLP tasks.
- XLNet - Generalized autoregressive pretraining for language understanding.
- RoBERTa - A Robustly Optimized BERT Pretraining Approach.
- ALBERT - A Lite BERT for self-supervised learning of language representations.
- OPT (Open Pre-trained Transformer) - Facebook's open-source language model.
- PaLM - Google's Pathways Language Model.
- The Illustrated Transformer - Visual guide to understanding the Transformer model.
- Transformers from Scratch - A tutorial on building Transformers.
- Hugging Face Course - Learn how to use Transformers and tokenizers.
- Stanford CS25: Transformers United - Advanced topics in Transformers.
- OpenAI Gym - A toolkit for developing and comparing reinforcement learning algorithms.
- Stable Baselines3 - Set of reliable implementations of reinforcement learning algorithms.
- RLlib - Scalable Reinforcement Learning library.
- TensorForce - A TensorFlow library for applied reinforcement learning.
- Keras-RL - Deep reinforcement learning with Keras.
- DeepMind's AlphaGo - Mastering the game of Go.
- OpenAI Five - Dota 2 AI bots.
- DQN (Deep Q-Network) - Human-level control through deep reinforcement learning.
- AlphaZero - Mastering games without human data.
- MuZero - Mastering games without knowing the rules.
- CS285: Deep Reinforcement Learning - UC Berkeley course on deep RL.
- Deep Reinforcement Learning Course by DeepMind - Comprehensive RL course.
- Spinning Up in Deep RL - Educational resource by OpenAI.
- Reinforcement Learning Specialization - University of Alberta.
- Attention Is All You Need - Introduced the Transformer model.
- Deep Residual Learning for Image Recognition - Introducing ResNets.
- Generative Adversarial Networks - The original GAN paper.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
- RLHF: Training Language Models to Follow Instructions with Human Feedback
- DALL·E: Zero-Shot Text-to-Image Generation
- AlphaGo Zero: Mastering the game of Go without human knowledge
- Neural Machine Translation by Jointly Learning to Align and Translate - Introduction of the attention mechanism.
- The Lottery Ticket Hypothesis - Finding sparse, trainable neural networks.
- Adam: A Method for Stochastic Optimization - Introducing the Adam optimizer.
- khmer text data - Khmer unicode text data for unsupervised learning language model
-
ImageNet - Large visual database for use in visual object recognition research.
-
COCO Dataset - Large-scale object detection, segmentation, and captioning dataset.
-
Common Crawl - An open repository of web crawl data.
-
GLUE Benchmark - General Language Understanding Evaluation.
-
MNIST Dataset - Handwritten digit database.
-
SQuAD - Stanford Question Answering Dataset.
-
LibriSpeech - Large corpus of read English speech.
-
CIFAR-10 and CIFAR-100 - Image classification datasets.
-
WikiText Language Modeling Dataset - Large language modeling dataset.
-
KITTI Vision Benchmark Suite - For autonomous driving research.
-
face recognition datasets - Collections of face data
-
VGGFace2 - a large-scale face recognition dataset. Images are downloaded from Google Image Search and have large variations in pose, age, illumination, ethnicity and profession.
-
VGGFace2 Torrent - VGGFace2 dataset download via torrent
-
wav2vec2-khmer-openslr - Wav2Vec2 with OpenSLR 42 (Khmer language)
-
Fast Khmer Dictionary - Fast khmer dictionary
-
tha - 📢 Tha (ថា) - A Khmer Text Normalization and Verbalization Toolkit
-
khmer line breaker - The lightweight Khmer line breaker with no dependencies. (10.4 kB)
-
khmer2number - A Khmer word to number converter.
-
sosab - 🗣️ sosap(សូរសព្ទ) Python binding for Phonetisaurus
-
khmerpronounce - Khmer Pronounciation Toolkit
-
khmersegment - A Khmer word segmentation tool built for NIPTICT (now CADT) Khmer Word Segmentation CRF model.
-
kfa - A fast Khmer Forced Aligner powered by Wav2Vec2CTC and Phonetisaurus
-
khmercut - A (fast) Khmer word segmentation toolkit.
-
pykhmernlp - Collections of Khmer language toolkit.
-
autocrop_kh - Automatic Document Segmentation and Cropping for Khmer IDs, Passport and Documents
-
ocr_toolkits - Collection of functions to work with ocr and synthetic data generator
-
sdab - Khmer Speech to Text Inference with Pretrained Model
- fast.ai - Simplifying AI and making it accessible.
- DeepMind Lab - A 3D learning environment for agent-based AI research.
- Keras - Deep learning for humans.
- PyTorch - An open-source machine learning library.
- TensorFlow Models - Models and examples built with TensorFlow.
- AllenNLP - An open-source NLP research library.
- Caffe - A deep learning framework made with expression, speed, and modularity in mind.
- Horovod - Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
- OpenAI Gym - A toolkit for developing and comparing reinforcement learning algorithms.
- OpenAI Baselines - High-quality implementations of reinforcement learning algorithms.
- CLIP (Contrastive Language–Image Pretraining) - Connects text and images.
- DALL·E - Creating images from text descriptions.
- OpenAI Codex - An AI system that translates natural language to code.
- OpenAI Microscope - Visualization of neurons in neural networks.
- fish speech - Brand new TTS solution
- vits - Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
- vits2 - Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
- Donut - Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
- deep text recognition - Text recognition (optical character recognition) with deep learning methods, ICCV 2019
- craft - Official implementation of Character Region Awareness for Text Detection (CRAFT)
- EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
- SAM2 - Segment Anything Model 2 (SAM 2) is a foundation model towards solving promptable visual segmentation in images and videos.
- Mask_RCNN - Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
- Mask_RCNN Pytorch - Mask R-CNN for object detection and instance segmentation on Pytorch
- insightface - State-of-the-art 2D and 3D Face Analysis Project
- InsightFace Pytorch - Pytorch0.4.1 codes for InsightFace
- facenet - Face Recoginition using Tensorflow
- facenet pytorch - Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
- facenet pytorch glint360k - A PyTorch implementation of the 'FaceNet' paper for training a facial recognition model with Triplet Loss using the glint360k dataset. A pre-trained model using Triplet Loss is available for download.
- openface - Face recognition with deep neural networks.
- face_recognition - The world's simplest facial recognition api for Python and the command line
- DeepFace - A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
- Retinaface - RetinaFace: Deep Face Detection Library for Python
- litgpt - 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
- litdata - Transform datasets at scale. Optimize datasets for fast AI model training.
- dify - Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
- LLaMA-Factory - Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
- Deep Learning - By Ian Goodfellow, Yoshua Bengio, and Aaron Courville.
- Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow - By Aurélien Géron.
- Natural Language Processing with PyTorch - By Delip Rao and Brian McMahan.
- Pattern Recognition and Machine Learning - By Christopher M. Bishop.
- Reinforcement Learning: An Introduction - By Richard S. Sutton and Andrew G. Barto.
- Probabilistic Programming & Bayesian Methods for Hackers - By Cameron Davidson-Pilon.
- Grokking Deep Learning - By Andrew W. Trask.
- The Elements of Statistical Learning - By Trevor Hastie, Robert Tibshirani, and Jerome Friedman.
- NeurIPS - Conference on Neural Information Processing Systems.
- ICML - International Conference on Machine Learning.
- CVPR - Conference on Computer Vision and Pattern Recognition.
- ACL - Association for Computational Linguistics.
- Journal of Machine Learning Research
- International Conference on Learning Representations (ICLR)
- AAAI Conference on Artificial Intelligence
- KDD Conference - Knowledge Discovery and Data Mining.
- Distill - Interactive, visual explanations of machine learning.
- AI Alignment Forum - Discussions on AI alignment.
- Andrej Karpathy's Blog - Articles on deep learning.
- OpenAI Blog - Latest news and research from OpenAI.
- The Batch by deeplearning.ai - Weekly newsletter on AI.
- DeepMind Blog - Research and news from DeepMind.
- Machine Learning Mastery - Tutorials on machine learning.
- Analytics Vidhya - Data science community.
- Weights & Biases - Experiment tracking, model management, and dataset versioning.
- TensorBoard - Visualization toolkit for TensorFlow.
- Docker - Container platform for building and deploying applications.
- Kaggle - Platform for data science competitions.
- Google Colab - Free Jupyter notebook environment.
- Azure Machine Learning - Cloud platform for machine learning.
- Amazon SageMaker - Build, train, and deploy ML models at scale.
- MLflow - Open-source platform for the machine learning lifecycle.
Contributions are welcome! Please submit a pull request or open an issue to add new resources.