Added OLMo support to builder.py #1061

shobrienDMA · 2024-11-12T14:38:53Z

No description provided.

…y fake layernorm Co-authored-by: Tim Costigan <[email protected]> Co-authored-by: Tim Costigan <[email protected]>"

…yerNorm process

… and set then in our override Co-authored-by: Tim Costigan <[email protected]> Co-authored-by: Tim Costigan <[email protected]>

shobrienDMA · 2024-11-25T09:32:13Z

@microsoft-github-policy-service agree company="AMD"

BowenBao · 2024-12-10T17:16:38Z

@kunal-vaishnavi ptal, thanks!

kunal-vaishnavi · 2024-12-10T22:06:49Z

Thanks for the contribution! Does OLMo run end-to-end with the ONNX Runtime GenAI tokenizer?

Can you also update the following places?

Add OLMo to the repo README and the model builder README to show that OLMo is now supported
Add OLMo to the CIs

onnxruntime-genai/test/python/_test_utils.py

Lines 55 to 77 in 0f59a90

    
           def get_model_paths(): 
        
               hf_paths = { 
        
                   "phi-2": "microsoft/phi-2", 
        
                   # "phi-3-mini": "microsoft/Phi-3-mini-128k-instruct", 
        
               } 
        
               ci_data_path = os.path.join("/", "data", "ortgenai_pytorch_models") 
        
               if not os.path.exists(ci_data_path): 
        
                   return {}, hf_paths 
        
               # Note: If a model has over 4B parameters, please add a quantized version 
        
               # to `ci_paths` instead of `hf_paths` to reduce file size and testing time. 
        
               ci_paths = { 
        
                   "llama-2": os.path.join(ci_data_path, "Llama-2-7B-Chat-GPTQ"), 
        
                   "llama-3": os.path.join(ci_data_path, "Meta-Llama-3-8B-AWQ"), 
        
                   "mistral-v0.2": os.path.join(ci_data_path, "Mistral-7B-Instruct-v0.2-GPTQ"), 
        
                   # "phi-2": os.path.join(ci_data_path, "phi2"), 
        
                   # "gemma-2b": os.path.join(ci_data_path, "gemma-1.1-2b-it"), 
        
                   "gemma-7b": os.path.join(ci_data_path, "gemma-7b-it-awq"), 
        
                   # "phi-3-mini": os.path.join(ci_data_path, "phi3-mini-128k-instruct"), 
        
               } 
        
               return ci_paths, hf_paths

The models in hf_paths are downloaded from Hugging Face, and the models in ci_paths are currently uploaded to /data/ortgenai_pytorch_models in the Linux CUDA CI VM.

onnxruntime-genai/.github/workflows/linux-gpu-x64-build.yml

Line 122 in 0f59a90

--volume /data/ortgenai_pytorch_models:/data/ortgenai_pytorch_models \

You can add it to hf_paths for now. If you can also add Qwen to the CIs, that would be helpful.

# Conflicts: # src/python/py/models/builder.py

…exist, which caused errors in model_qa.py

shobrienDMA · 2024-12-12T13:45:15Z

That is be updated as requested now. It runs end to end and I've also added Qwen to the CI list.

kunal-vaishnavi · 2024-12-17T05:35:25Z

Thank you for adding the changes. The end-to-end tests in the CIs appear to be failing due to the transformers version. Can you pin it to v4.44.2?

onnxruntime-genai/test/python/requirements.txt

Line 9 in 9055e68

transformers

shobrienDMA · 2024-12-17T16:58:44Z

This should be good to go!

kunal-vaishnavi · 2024-12-18T19:42:48Z

After some further investigation, it appears that the tokenizer CI failure is happening because the tokenizer for OLMo is not currently supported in ONNX Runtime Extensions. Once the support is added, the main branch of ONNX Runtime GenAI can be merged into this PR to integrate the changes.

shobrienDMA and others added 5 commits November 12, 2024 09:09

adding OLMO to the list of Decoder Only Models

c94ee92

Added OLMoModel Class and config.architecture detection, and temporar…

149cef2

…y fake layernorm Co-authored-by: Tim Costigan <[email protected]> Co-authored-by: Tim Costigan <[email protected]>"

Comment out our hack, modify the OLMo class to attempt to skip the La…

df8107f

…yerNorm process

add olmo builder support

c2071d5

Pulled the layernorm.weight and layernorm.bias values from the config…

814faa6

… and set then in our override Co-authored-by: Tim Costigan <[email protected]> Co-authored-by: Tim Costigan <[email protected]>

shobrienDMA marked this pull request as ready for review November 25, 2024 10:16

Merge branch 'microsoft:main' into shobrien/add-olmo-builder-support

2faf06e

kunal-vaishnavi mentioned this pull request Dec 10, 2024

adding OLMo to the list of Decoder Only Models #1060

Closed

shobrienDMA added 4 commits December 12, 2024 09:27

Merge branch 'main' into shobrien/add-olmo-builder-support

d0bc543

# Conflicts: # src/python/py/models/builder.py

fix new issue where bos_token_id was always set to None if it didn't …

1ba81c8

…exist, which caused errors in model_qa.py

Update readmes and add OLMo and Qwen to the CI tests_utils

30bb7a5

Merge branch 'main' into shobrien/add-olmo-builder-support

45dc21b

shobrienDMA added 2 commits December 17, 2024 10:13

Merge branch 'main' into shobrien/add-olmo-builder-support

37bd4fb

pin transformers to 4.44.2 for the python tests

05f9208

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added OLMo support to builder.py #1061

Added OLMo support to builder.py #1061

shobrienDMA commented Nov 12, 2024

shobrienDMA commented Nov 25, 2024

BowenBao commented Dec 10, 2024

kunal-vaishnavi commented Dec 10, 2024

shobrienDMA commented Dec 12, 2024 •

edited

Loading

kunal-vaishnavi commented Dec 17, 2024

shobrienDMA commented Dec 17, 2024

kunal-vaishnavi commented Dec 18, 2024

Added OLMo support to builder.py #1061

Are you sure you want to change the base?

Added OLMo support to builder.py #1061

Conversation

shobrienDMA commented Nov 12, 2024

shobrienDMA commented Nov 25, 2024

BowenBao commented Dec 10, 2024

kunal-vaishnavi commented Dec 10, 2024

shobrienDMA commented Dec 12, 2024 • edited Loading

kunal-vaishnavi commented Dec 17, 2024

shobrienDMA commented Dec 17, 2024

kunal-vaishnavi commented Dec 18, 2024

shobrienDMA commented Dec 12, 2024 •

edited

Loading