nlp_course/week06_llm at 2024 · yandexdataschool/nlp_course

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
practice.ipynb		practice.ipynb

README.md

Large language models

Lecture slides
Video (in russian): lecture, practice
Practice session: practice.ipynb

Applications mentioned in the lecture:

the Gandalf game by Lakera.ai
aidungeon.com - an LLM-generated role-playing game
ora.ai - a tool to build your own chatbot with prompting
this is not an exhaustive list: there's a million of various applications using LLMs

Open-source models mentioned in the lecture:

LLaMA-2 - https://huggingface.co/meta-llama/Llama-2-70b
Falcon - https://huggingface.co/tiiuae/falcon-180B
BLOOM - https://huggingface.co/bigscience/bloom

Some of those models require you to apply for access, and model authors may take time to process your application. While you are waiting for your license to be processed, you may wish to browse the huggingface hub for alternative (e.g. quantized) versions of the same model that are available immediately with no application. For example, here's a LLaMA-2-70B quantized to 4-bit and available using the same transformers.AutoModelForCausalLM.from_pretrained syntax. Please note that, while the hub allows you to download and use those model versions without officially applying for access to LLaMA-2, the original model's license may restrict this kind of usage.

Extra materials:

Glitch tokens (lecture mentions SolidGoldMagikarp) - blog post by Jessica Rumbelow, mwatkins
"Sparks of AGI" - a controversial but influential paper about worrying LLM abilities
BigBench - a benchmark of emergent LLM abilities mentioned in the slides
Chain of thought papers: Few-shot: Wei et al. (2022) few-shot
A guide to prompt injection and jailbreaking: https://learnprompting.org/docs/prompt_hacking/injection
A repo with popular jailbreaks for GPTx models: https://github.com/0xk1h0/ChatGPT_DAN
A ton of other cool stuff linked in the lecture slides (see the top of this readme)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

week06_llm

week06_llm

README.md

Large language models

Files

week06_llm

Directory actions

More options

Directory actions

More options

Latest commit

History

week06_llm

Folders and files

parent directory

README.md

Large language models