Adding NXD and NKI example #26

EmilyWebber · 2024-10-08T21:41:26Z

Issue #, if available:

No issues.

Description of changes:

This adds a new example of NXD with TinyLLama and a custom NKI kernel for tensor addition. I've confirmed the kernel for accuracy and performance.

Testing:

Please see detailed unit test requirements in the CONTRIBUTING.md

The change is covered by numeric check using nki.baremetal
The change is covered by performance benchmark test using nki.benchmark
The change is covered by end-to-end integration test

Pull Request Checklist

I have filled in all the required field in the template
I have tested locally that all the tests pass
By submitting this pull request, I confirm that my contribution is made under the terms of the MIT-0 license.

JonathanHenson · 2024-10-09T01:24:16Z

nki_university/nki_and_nxd_llama_inference/README.md

@@ -0,0 +1,40 @@
+# TinyLLama inference with NeuronX Distributed and Neuron Kernel Interface
+In this example you can test [TinyLlama](https://huggingface.co/TinyLlama) from Hugging Face on AWS Trainium. This example was built on a trn1.2xlarge machine using this AMI: Deep Learning AMI Neuron (Ubuntu 22.04) 20240927.


Maybe “instance” rather than “machine”

JonathanHenson · 2024-10-09T01:27:44Z

nki_university/nki_and_nxd_llama_inference/README.md

+### Test the script
+Once you've installed all the packages and downloaded your model, you should be ready to test the script. This is done with `python run_llama.py`. 
+
+This script will take at least 30 minutes to complete because it does the following: 1/ compile your model 2/ load to Neuron device 3/ test on Neuron 4/ compare accuracy 5/ run benchmark suite.


Nit: either
“it will do the following:” or the verbs in the list become plural (compiles, loads, etc…)

EmilyWebber added 2 commits October 8, 2024 21:34

Adding readme

03ac74a

Adding NKI and NXD example for NKI university

6e74c83

JonathanHenson reviewed Oct 21, 2024

View reviewed changes

JonathanHenson added the enhancement New feature or request label Oct 21, 2024

Update README.md

9a84b56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding NXD and NKI example #26

Adding NXD and NKI example #26

EmilyWebber commented Oct 8, 2024

JonathanHenson Oct 9, 2024

JonathanHenson Oct 9, 2024

		@@ -0,0 +1,40 @@
		# TinyLLama inference with NeuronX Distributed and Neuron Kernel Interface
		In this example you can test [TinyLlama](https://huggingface.co/TinyLlama) from Hugging Face on AWS Trainium. This example was built on a trn1.2xlarge machine using this AMI: Deep Learning AMI Neuron (Ubuntu 22.04) 20240927.

Adding NXD and NKI example #26

Are you sure you want to change the base?

Adding NXD and NKI example #26

Conversation

EmilyWebber commented Oct 8, 2024

Issue #, if available:

Description of changes:

Testing:

Pull Request Checklist

JonathanHenson Oct 9, 2024

Choose a reason for hiding this comment

JonathanHenson Oct 9, 2024

Choose a reason for hiding this comment