20 Dec 21:29

salmanap

0.1.7 Latest

Latest

What's Changed

fix code bug in weather forecast demo by @adilhafeez in #340
Break apart common_types mod by @junr03 in #334
Break apart metrics into their own module by @junr03 in #335
add more granular bucket sizes for ttft by @aayushwhiz in #343
Use intent model from archfc to pick prompt gateway by @nehcgs in #328

Full Changelog: 0.1.6...0.1.7

Contributors

junr03, adilhafeez, and 2 other contributors

Assets 2

06 Dec 23:48

adilhafeez

Release 0.1.6

What's Changed

add note about hosted arch-fc by @adilhafeez in #308
correctly map stats port to host by @adilhafeez in #311
update alertmanager version to v2, remove the merge artifacts by @peterj in #309
hallucination with log probs by @cotran2 in #281
updating doc versions, images and cleaning up section for prompt-guard by @salmanap in #320
322 add support for pydantic logfire for llm agent tracing by @aayushwhiz in #329
docs: update README.md by @eltociear in #332
fix the README for the weather_forecasting demo by @aayushwhiz in #336
update getting started guide and add llm gateway and prompt gateway samples by @adilhafeez in #330
update quick start rst to be in sync with readme.md by @adilhafeez in #337
update cli to 0.1.6 by @adilhafeez in #338

New Contributors

@peterj made their first contribution in #309

Full Changelog: 0.1.5...0.1.6

Contributors

salmanap, peterj, and 4 other contributors

Assets 2

26 Nov 21:29

adilhafeez

release 0.1.5 🎉

What's Changed

use envoy to publish traces by @adilhafeez in #270
publish docker image by @adilhafeez in #283
publish docker image on release only by @adilhafeez in #284
updating readme and docs with note about Arch-Function by @salmanap in #285
update readme with python version by @adilhafeez in #302
use fix prompt guards by @adilhafeez in #303
Remove blanket unused imports from the common crate by @junr03 in #292
add schema change to use enum in arch_config by @adilhafeez in #304
remove dependency on docker-compose when starting up archgw by @adilhafeez in #305
release 0.1.5 by @adilhafeez in #307

Full Changelog: 0.1.3...0.1.5

Contributors

salmanap, junr03, and adilhafeez

Assets 2

18 Nov 01:13

adilhafeez

Release 0.1.3 🎉

What's Changed

updating README with PH launch results by @salmanap in #268
Add in tpot by @aayushwhiz in #269
move custom tracer to llm filter by @adilhafeez in #267
updating docs to reflect changes in 0.1.2 like tracing via signoz and… by @salmanap in #271
updated website with PH badge by @salmanap in #272
updating website to track analytics by @salmanap in #273
fix start time of llm filter by @adilhafeez in #278
release 0.1.3 by @adilhafeez in #280

Full Changelog: 0.1.2...0.1.3

Contributors

salmanap, adilhafeez, and aayushwhiz

Assets 2

13 Nov 08:00

adilhafeez

Release 0.1.2 🎉 Latest

Built packages

pypi: archgw/0.1.2
pypi: archgw_modelserver/0.1.2
docker image: b35f75d6f5b0 (katanemo/archgw:latest)

What's Changed

fix ports and update README for paths to agent/chat by @salmanap in #240
fixing ports in arch_config for demos by @salmanap in #241
updating README logo by @salmanap in #242
add http files for llm and prompt gateway for local testing by @adilhafeez in #244
retry embeddings fetch by @adilhafeez in #245
chore: update stream_context.rs by @eltociear in #248
add prefill and test by @cotran2 in #236
add support for jaeger tracing by @adilhafeez in #229
update docs with weather_forecast path by @adilhafeez in #253
add requirements to readme by @adilhafeez in #249
obfuscate auth header by @adilhafeez in #254
removing hard depdency on mistral keys in weather forecast demos by @salmanap in #256
update rust toolchain to 1.82 by @adilhafeez in #255
Add service to stream custom otel traces to otel-collector by @adilhafeez in #262
Update arch stats by @aayushwhiz in #250
move grafana and prometheus to shared by @adilhafeez in #265
release 0.1.2 by @adilhafeez in #266

Full Changelog: 0.1.0...0.1.2

Contributors

salmanap, adilhafeez, and 3 other contributors

Assets 2

31 Oct 02:03

adilhafeez

Release 0.1.0 🎉

0.1.0 release

Built packages

pypi: archgw/0.1.0
pypi: archgw_modelserver/0.1.1
docker image: 64ec02e9d1d5 (katanemo/archgw:latest)

What's Changed

Add initial scaffolding by @junr03 in #1
add checks on pr to ensure code compiles and passes static analysis checks. by @aayushwhiz in #4
Add stats API by @junr03 in #5
Parse katanemo config using serde/yaml package by @adilhafeez in #6
Setup pre-commit so it runs locally before every git push by 🎉@adilhafeez in #12
add embedding store by @adilhafeez in #10
Add testing to GitHub workflow by @aayushwhiz in #15
Add initial logic to send prompts to LLM API by @junr03 in #9
Delete example code, and move prod code into functional separation by @junr03 in #16
Address comments from clean up PR by @junr03 in #17
Use open-message-format to serialize and deserialize embeddings api by @adilhafeez in #18
update pre-commit to only run fmt on intelligent-prompt-gateway package by @adilhafeez in #22
refactor code a bit by @adilhafeez in #21
Add initial integration style tests by @junr03 in #20
Improve error handling by @junr03 in #23
Add basic workflow logic by @adilhafeez in #24
Clean up imports by @junr03 in #25
Remove unnecessary clones by @junr03 in #26
Remove top level container in config and use snake-case for config by @adilhafeez in #27
update open-message-format@1e838f3 by @adilhafeez in #30
Improve demo experience by @adilhafeez in #28
Simplify Entity struct by @junr03 in #33
Add support for local llm (mistral 7b) by @adilhafeez in #31
Add Ratelimit config by @junr03 in #32
Add the Ratelimit module by @junr03 in #34
Move shared types into their own crate by @junr03 in #41
Add Ratelimit on request tokens by @junr03 in #44
Fix wasm load error by @junr03 in #45
Add function calling support using bolt-fc-1b by @adilhafeez in #35
More idiomatic conversion by @junr03 in #47
Add small clarification to function calling demo by @junr03 in #48
Resetting instead of resuming by @junr03 in #39
Improve prompt target intent matching by @adilhafeez in #51
rename envoy-1 => bolt-1 by @adilhafeez in #53
Improve service names by @adilhafeez in #54
add bolt guard config by @adilhafeez in #55
Propagate enums from prompt-target to bolt-fc by @adilhafeez in #56
Add ability to stream a response by @junr03 in #50
add ability to override default values from config by @adilhafeez in #58
send all tools when sending request to arch-fc by @adilhafeez in #59
Add ability to define clusters in config by @adilhafeez in #61
demos for network copilot and sql analyzer by @Sampreeth-sarma in #57
use openai standard response in arch-fc and in gradio client by @adilhafeez in #62
fix webui url and dependencies by @adilhafeez in #66
Delete unused envoy.yaml file by @junr03 in #65
remove embeddings config from config by @adilhafeez in #64
Include param default in parameters by @adilhafeez in #68
fix similarity bug by @Sampreeth-sarma in #63
move demo functions out of model_server by @adilhafeez in #67
Docs branch - v1 of our tech docs by @salmanap in #69
[Kan-103] add support toxic/jailbreak model by @cotran2 in #49
Improve response handling by @adilhafeez in #71
add details on how to use grafana dashboards by @adilhafeez in #72
improve cluster not configured error by @adilhafeez in #73
remove guard config json by @cotran2 in #70
Adil/fix salman docs by @salmanap in #75
fix demos code by @adilhafeez in #76
Remove OMF by @adilhafeez in #78
Add support for multiple LLM Providers by @junr03 in #60
improve logging of api failure by @adilhafeez in #79
add readme and docker build by @adilhafeez in #81
send history to bolt fc model by @adilhafeez in #84
Added Float type to the function parameter values by @Sampreeth-sarma in #77
Fix bug in PromptGuard configuration by @junr03 in #80
Revert "Revert "Add support for multiple LLM Providers (#60)"" by @junr03 in #83
Integrate Arch-Function-Calling-1.5B model by @adilhafeez in #85
V1 docs push by @salmanap in #86
expose access logs from envoy by @adilhafeez in #89
add bolt support by @adilhafeez in #90
Add supported parameter type, validation and tests by @adilhafeez in #88
Salmanap/docs v1 push by @salmanap in #92
rename envoyfilter => arch by @adilhafeez in #91
add support for 3b model by @adilhafeez in #96
changes prometheus target to arch so that data collection works by @aayushwhiz in #98
add precommit check by @adilhafeez in #97
update config by @adilhafeez in #93
Remove unnecessary envoy.yaml by @junr03 in #99
remove method type by @adilhafeez in #101
Rename bolt_config to arch_config by @junr03 in #100
Fold function_resolver into model_server by @adilhafeez in #103
add messages in params when making api calls by @adilhafeez in #104
arch schema validator by @adilhafeez in #105
simplify developer getting started experience by @salmanap in #102
rename archgw_model_sever => model_server by @adilhafeez in #106
ArchFC endpoint integration by @cotran2 in #94
update path for arch_config.yaml file by @adilhafeez in #107
change root to config in config_generator by @aayushwhiz in #108
fix fc integration by @cotran2 in #110
change default stat_prefix from ingress_http to arch by @aayushwhiz in #109
add support for default target by @adilhafeez in #111
Add the ability to use LLM Providers from the Arch config by @junr03 in #112
Remove optional PromptGuards from Stream Context by @junr03 in #113
archgw cli by @salmanap in #117
Cotran/hallucination integration by @cotran2 in #115
Remove hardcoded Access Key Envvars by @junr03 in #118
load_models checks for device before getting the BGE or NLI model loa… by @salmanap in #119
fix embeddings not found bug by @adilhafeez in #120
[BUG FIX] Add missing chat completions request hint by @junr03 in #116
Clean up Embeddings Store by @junr03 in #121
fix pkg_resources error by @adilhafeez in ...

Read more

Contributors

salmanap, junr03, and 7 other contributors

Assets 2