llm-stuff A collection of findings, documents, and code snippets about LLMs Structured output with vLLM + Guidance Structured output vLLM vLLM + Langchain Early stopping the generation in HF Transformers