Skip to content

Comprehensive Compilation of Customized LLMs for Specific Domains and Industries

License

Notifications You must be signed in to change notification settings

kennethleungty/Domain-LLMs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Domain Specific LLMs

Comprehensive Compilation of Customized LLMs for Specific Domains and Industries

Landscape

Image generated by Midjourney v5

Context

  • Large language models (LLMs) have revolutionized the landscape of natural language processing, showing unparalleled prowess in a wide array of tasks, from simple text generation to complex problem-solving.
  • As the potential of LLMs continues to unfold, there's an increasing demand to tailor these models for specific domains and industries, ensuring that their vast knowledge base is attuned to specialized requirements.
  • This repo aims to create a database of domain-specific LLMs optimized for different sectors, ranging from healthcare and legal to finance and entertainment.
  • It seeks to bridge the gap between generic LLMs and niche applications, showcasing tools that truly understand and cater to the unique linguistic nuances and knowledge demands of different industries.

Contributions Welcome!


Contents

  1. Biology
  2. Finance
  3. Healthcare
  4. Information Technology

Biology

Name Type Description Demo Paper Repo Site
ProtGPT2 Pre-trained LLM (with 738 million parameters) specifically for protein engineering and design by being trained on the protein space that generates de novo protein sequences following principles of natural ones. 🔗 - - 🔗

Finance

Name Type Description Demo Paper Repo Site
BloombergGPT Pre-trained 50-billion parameter LLM trained on a wide range of financial data (363 billion token dataset) - 🔗 - -
FinChat ? Generative AI tool for investment research, helping to greatly reduce time requirements for data aggregation, visualization and summaries. 🔗 - - 🔗
FinGPT Fine-tuned Series of LLMs fine-tuned on base models (e.g., Llama-2) with open finance data - 🔗 🔗 🔗
FinMA Fine-tuned Financial LLM from fine-tuning LLaMa with finance-based instruction data with 136K data samples 🔗 🔗 🔗 -
Ask FT ? LLM tool that allows users to ask any question and receive a response using Financial Times (FT) content published over the last two decades. 🔗 🔗 - -

Healthcare

Name Type Description Repo Paper Demo Site
Med-PaLM Fine-tuned Google's LLM (fine-tuned using PaLM as base model) designed to provide high quality answers to medical questions. - 🔗 - 🔗
Med-PaLM 2 Fine-tuned Enhanced version of Med-PaLM released on March 2023 by Google with improved performance 🔗 🔗 🔗 🔗
PharmacyGPT In-context Learning GPT-4 model coupled with in-context learning (dynamic prompting approach) involving domain-specific data - 🔗 - -
RUSSELL-GPT Fine-tuned LLM developed by National University Health System in Singapore to enhance clinicians' productivity (e.g., medical Q&A, case note summarization) - - - 🔗

Information Technology (IT)

Name Type Description Repo Paper Demo Site
OWL Fine-tuned A large language model for IT operations fine-tuned based on a custom Owl-Instruct dataset with a wide range of IT-related information - 🔗 - -


To Do

  • Include examples from the range of other domains/industries listed in Contributing
  • Include non-LLM GenAI examples (expand scope of repo)