Skip to content

v0.16.0

Latest
Compare
Choose a tag to compare
@XprobeBot XprobeBot released this 18 Oct 11:40
5f7dea4

What's new in 0.16.0 (2024-10-18)

These are the changes in inference v0.16.0.

New features

  • FEAT: Adding support for awq/gptq vLLM inference to VisionModel such as Qwen2-VL by @cyhasuka in #2445
  • FEAT: Dynamic batching for the state-of-the-art FLUX.1 text_to_image interface by @ChengjieLi28 in #2380
  • FEAT: added MLX for qwen2.5-instruct by @qinxuye in #2444

Enhancements

Documentation

New Contributors

Full Changelog: v0.15.4...v0.16.0