v0.8.0
版本发布时间: 2023-01-24 07:20:48
nebuly-ai/optimate最新发布版本:chatllama0.0.4(2023-03-27 21:47:20)
nebullvm 0.8.0 Release Notes
This is a major release fixing multiple bugs and implementing two new functions for loading and saving the models.
New Features
- Implements two new functions for loading and saving inference learners.
Bug fixed
- Fixed bug for ONNXRuntime models being loaded on the wrong device after the optimization.
- Fixes TensorRT behaviour when using dynamic shape
- Improves significantly the performance of TensorRT with ONNX interface
- Limits the gpu memory used by tensorflow, to avoid memory issues during tensorflow models optimization
- Fixes some issues with unit tests, and adds additional controls to ensure that an optimized model with dynamic shape works properly.
- removes setuptools from tensorrt installation, it's not needed anymore by the newer version and it was causing issues with the azure pipelines.
speedster 0.2.0 Release Notes
Major release for Speedster adding the load_model
and save_model
functions.
New Features
- Improved the logs
- Save and load model functions can be imported directly from speedster
- Brand new documentation moved directly in the GitHub repository and built using
mkdocs
Bug Fixed
- Fixed version compatibility issue on Colab
- adds missing port forwarding in the docker run command inside the notebooks readme.
Contributors
- Valerio Sofi (@valeriosofi)
- Diego Fiori (@diegofiori)
- Leonardo Zecchin (@ZecchinLeonardo)
- Arian Ghasemi (@arianGh1)