Releases: katanemo/archgw
Releases · katanemo/archgw
0.1.7
What's Changed
- fix code bug in weather forecast demo by @adilhafeez in #340
- Break apart common_types mod by @junr03 in #334
- Break apart metrics into their own module by @junr03 in #335
- add more granular bucket sizes for ttft by @aayushwhiz in #343
- Use intent model from archfc to pick prompt gateway by @nehcgs in #328
Full Changelog: 0.1.6...0.1.7
Release 0.1.6
What's Changed
- add note about hosted arch-fc by @adilhafeez in #308
- correctly map stats port to host by @adilhafeez in #311
- update alertmanager version to v2, remove the merge artifacts by @peterj in #309
- hallucination with log probs by @cotran2 in #281
- updating doc versions, images and cleaning up section for prompt-guard by @salmanap in #320
- 322 add support for pydantic logfire for llm agent tracing by @aayushwhiz in #329
- docs: update README.md by @eltociear in #332
- fix the README for the weather_forecasting demo by @aayushwhiz in #336
- update getting started guide and add llm gateway and prompt gateway samples by @adilhafeez in #330
- update quick start rst to be in sync with readme.md by @adilhafeez in #337
- update cli to 0.1.6 by @adilhafeez in #338
New Contributors
Full Changelog: 0.1.5...0.1.6
release 0.1.5 🎉
What's Changed
- use envoy to publish traces by @adilhafeez in #270
- publish docker image by @adilhafeez in #283
- publish docker image on release only by @adilhafeez in #284
- updating readme and docs with note about Arch-Function by @salmanap in #285
- update readme with python version by @adilhafeez in #302
- use fix prompt guards by @adilhafeez in #303
- Remove blanket unused imports from the common crate by @junr03 in #292
- add schema change to use enum in arch_config by @adilhafeez in #304
- remove dependency on docker-compose when starting up archgw by @adilhafeez in #305
- release 0.1.5 by @adilhafeez in #307
Full Changelog: 0.1.3...0.1.5
Release 0.1.3 🎉
What's Changed
- updating README with PH launch results by @salmanap in #268
- Add in tpot by @aayushwhiz in #269
- move custom tracer to llm filter by @adilhafeez in #267
- updating docs to reflect changes in 0.1.2 like tracing via signoz and… by @salmanap in #271
- updated website with PH badge by @salmanap in #272
- updating website to track analytics by @salmanap in #273
- fix start time of llm filter by @adilhafeez in #278
- release 0.1.3 by @adilhafeez in #280
Full Changelog: 0.1.2...0.1.3
Release 0.1.2 🎉 Latest
Built packages
- pypi: archgw/0.1.2
- pypi: archgw_modelserver/0.1.2
- docker image: b35f75d6f5b0 (katanemo/archgw:latest)
What's Changed
- fix ports and update README for paths to agent/chat by @salmanap in #240
- fixing ports in arch_config for demos by @salmanap in #241
- updating README logo by @salmanap in #242
- add http files for llm and prompt gateway for local testing by @adilhafeez in #244
- retry embeddings fetch by @adilhafeez in #245
- chore: update stream_context.rs by @eltociear in #248
- add prefill and test by @cotran2 in #236
- add support for jaeger tracing by @adilhafeez in #229
- update docs with weather_forecast path by @adilhafeez in #253
- add requirements to readme by @adilhafeez in #249
- obfuscate auth header by @adilhafeez in #254
- removing hard depdency on mistral keys in weather forecast demos by @salmanap in #256
- update rust toolchain to 1.82 by @adilhafeez in #255
- Add service to stream custom otel traces to otel-collector by @adilhafeez in #262
- Update arch stats by @aayushwhiz in #250
- move grafana and prometheus to shared by @adilhafeez in #265
- release 0.1.2 by @adilhafeez in #266
Full Changelog: 0.1.0...0.1.2
Release 0.1.0 🎉
0.1.0 release
Built packages
- pypi: archgw/0.1.0
- pypi: archgw_modelserver/0.1.1
- docker image: 64ec02e9d1d5 (katanemo/archgw:latest)
What's Changed
- Add initial scaffolding by @junr03 in #1
- add checks on pr to ensure code compiles and passes static analysis checks. by @aayushwhiz in #4
- Add stats API by @junr03 in #5
- Parse katanemo config using serde/yaml package by @adilhafeez in #6
- Setup pre-commit so it runs locally before every git push by 🎉@adilhafeez in #12
- add embedding store by @adilhafeez in #10
- Add testing to GitHub workflow by @aayushwhiz in #15
- Add initial logic to send prompts to LLM API by @junr03 in #9
- Delete example code, and move prod code into functional separation by @junr03 in #16
- Address comments from clean up PR by @junr03 in #17
- Use open-message-format to serialize and deserialize embeddings api by @adilhafeez in #18
- update pre-commit to only run fmt on intelligent-prompt-gateway package by @adilhafeez in #22
- refactor code a bit by @adilhafeez in #21
- Add initial integration style tests by @junr03 in #20
- Improve error handling by @junr03 in #23
- Add basic workflow logic by @adilhafeez in #24
- Clean up imports by @junr03 in #25
- Remove unnecessary clones by @junr03 in #26
- Remove top level container in config and use snake-case for config by @adilhafeez in #27
- update open-message-format@1e838f3 by @adilhafeez in #30
- Improve demo experience by @adilhafeez in #28
- Simplify Entity struct by @junr03 in #33
- Add support for local llm (mistral 7b) by @adilhafeez in #31
- Add Ratelimit config by @junr03 in #32
- Add the Ratelimit module by @junr03 in #34
- Move shared types into their own crate by @junr03 in #41
- Add Ratelimit on request tokens by @junr03 in #44
- Fix wasm load error by @junr03 in #45
- Add function calling support using bolt-fc-1b by @adilhafeez in #35
- More idiomatic conversion by @junr03 in #47
- Add small clarification to function calling demo by @junr03 in #48
- Resetting instead of resuming by @junr03 in #39
- Improve prompt target intent matching by @adilhafeez in #51
- rename envoy-1 => bolt-1 by @adilhafeez in #53
- Improve service names by @adilhafeez in #54
- add bolt guard config by @adilhafeez in #55
- Propagate enums from prompt-target to bolt-fc by @adilhafeez in #56
- Add ability to stream a response by @junr03 in #50
- add ability to override default values from config by @adilhafeez in #58
- send all tools when sending request to arch-fc by @adilhafeez in #59
- Add ability to define clusters in config by @adilhafeez in #61
- demos for network copilot and sql analyzer by @Sampreeth-sarma in #57
- use openai standard response in arch-fc and in gradio client by @adilhafeez in #62
- fix webui url and dependencies by @adilhafeez in #66
- Delete unused envoy.yaml file by @junr03 in #65
- remove embeddings config from config by @adilhafeez in #64
- Include param default in parameters by @adilhafeez in #68
- fix similarity bug by @Sampreeth-sarma in #63
- move demo functions out of model_server by @adilhafeez in #67
- Docs branch - v1 of our tech docs by @salmanap in #69
- [Kan-103] add support toxic/jailbreak model by @cotran2 in #49
- Improve response handling by @adilhafeez in #71
- add details on how to use grafana dashboards by @adilhafeez in #72
- improve cluster not configured error by @adilhafeez in #73
- remove guard config json by @cotran2 in #70
- Adil/fix salman docs by @salmanap in #75
- fix demos code by @adilhafeez in #76
- Remove OMF by @adilhafeez in #78
- Add support for multiple LLM Providers by @junr03 in #60
- improve logging of api failure by @adilhafeez in #79
- add readme and docker build by @adilhafeez in #81
- send history to bolt fc model by @adilhafeez in #84
- Added Float type to the function parameter values by @Sampreeth-sarma in #77
- Fix bug in PromptGuard configuration by @junr03 in #80
- Revert "Revert "Add support for multiple LLM Providers (#60)"" by @junr03 in #83
- Integrate Arch-Function-Calling-1.5B model by @adilhafeez in #85
- V1 docs push by @salmanap in #86
- expose access logs from envoy by @adilhafeez in #89
- add bolt support by @adilhafeez in #90
- Add supported parameter type, validation and tests by @adilhafeez in #88
- Salmanap/docs v1 push by @salmanap in #92
- rename envoyfilter => arch by @adilhafeez in #91
- add support for 3b model by @adilhafeez in #96
- changes prometheus target to arch so that data collection works by @aayushwhiz in #98
- add precommit check by @adilhafeez in #97
- update config by @adilhafeez in #93
- Remove unnecessary envoy.yaml by @junr03 in #99
- remove method type by @adilhafeez in #101
- Rename bolt_config to arch_config by @junr03 in #100
- Fold function_resolver into model_server by @adilhafeez in #103
- add messages in params when making api calls by @adilhafeez in #104
- arch schema validator by @adilhafeez in #105
- simplify developer getting started experience by @salmanap in #102
- rename archgw_model_sever => model_server by @adilhafeez in #106
- ArchFC endpoint integration by @cotran2 in #94
- update path for arch_config.yaml file by @adilhafeez in #107
- change root to config in config_generator by @aayushwhiz in #108
- fix fc integration by @cotran2 in #110
- change default stat_prefix from ingress_http to arch by @aayushwhiz in #109
- add support for default target by @adilhafeez in #111
- Add the ability to use LLM Providers from the Arch config by @junr03 in #112
- Remove optional PromptGuards from Stream Context by @junr03 in #113
- archgw cli by @salmanap in #117
- Cotran/hallucination integration by @cotran2 in #115
- Remove hardcoded Access Key Envvars by @junr03 in #118
- load_models checks for device before getting the BGE or NLI model loa… by @salmanap in #119
- fix embeddings not found bug by @adilhafeez in #120
- [BUG FIX] Add missing chat completions request hint by @junr03 in #116
- Clean up Embeddings Store by @junr03 in #121
- fix pkg_resources error by @adilhafeez in ...