Skip to content

Actions: flashinfer-ai/flashinfer

Automatically bump version and release Python wheels

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
320 workflow runs
320 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

triton: cascade kernels (#396)
Automatically bump version and release Python wheels #220: Commit 2496f5b pushed by yzh119
July 29, 2024 03:23 25s main
July 29, 2024 03:23 25s
feat: support non-contiguous (packed) input for prefill kernels (#404)
Automatically bump version and release Python wheels #219: Commit 68c3719 pushed by yzh119
July 29, 2024 03:18 22s main
July 29, 2024 03:18 22s
feat: add llama 3.1 style rope (#401)
Automatically bump version and release Python wheels #218: Commit 4c89dec pushed by yzh119
July 27, 2024 10:37 24s main
July 27, 2024 10:37 24s
misc: add vllm to adoption list (#399)
Automatically bump version and release Python wheels #217: Commit 73a764b pushed by yzh119
July 26, 2024 08:01 22s main
July 26, 2024 08:01 22s
ci: add torch 12.4 to the matrix configuration (#398)
Automatically bump version and release Python wheels #216: Commit de16915 pushed by yzh119
July 26, 2024 07:31 25s main
July 26, 2024 07:31 25s
perf: slight optimization on merge states (#313)
Automatically bump version and release Python wheels #215: Commit 701c813 pushed by yzh119
July 24, 2024 06:59 33s main
July 24, 2024 06:59 33s
refactor: use c++17 style structure bindings (#393)
Automatically bump version and release Python wheels #214: Commit 2ab2bca pushed by yzh119
July 24, 2024 06:51 17s main
July 24, 2024 06:51 17s
ci: setup mypy, pylint and cpplint (#389)
Automatically bump version and release Python wheels #213: Commit 5da6577 pushed by yzh119
July 21, 2024 07:25 17s main
July 21, 2024 07:25 17s
misc: clang-format prefill.cuh (#388)
Automatically bump version and release Python wheels #212: Commit 8e377ba pushed by yzh119
July 21, 2024 04:28 16s main
July 21, 2024 04:28 16s
hotfix: fix the bug in #386 (#387)
Automatically bump version and release Python wheels #211: Commit dc3f184 pushed by yzh119
July 21, 2024 00:32 19s main
July 21, 2024 00:32 19s
bugfix: fix sampling API's behavior on cu118 (#386)
Automatically bump version and release Python wheels #210: Commit 0cd4994 pushed by yzh119
July 21, 2024 00:24 19s main
July 21, 2024 00:24 19s
chore(main): release 0.1.1 (#381)
Automatically bump version and release Python wheels #209: Commit b64d5c9 pushed by yzh119
July 20, 2024 09:15 5h 35m 1s main
July 20, 2024 09:15 5h 35m 1s
bugfix: Fix invalid kernel configuration for sm86 (#385)
Automatically bump version and release Python wheels #208: Commit cdac577 pushed by yzh119
July 20, 2024 09:09 20s main
July 20, 2024 09:09 20s
feat: expose decoupled kv-cache to pytorch api (#383)
Automatically bump version and release Python wheels #207: Commit 457a0ae pushed by yzh119
July 20, 2024 01:25 23s main
July 20, 2024 01:25 23s
perf: use stmatrix in epilogue for sm90+ (#380)
Automatically bump version and release Python wheels #206: Commit c6f20d1 pushed by yzh119
July 19, 2024 02:43 23s main
July 19, 2024 02:43 23s
refactor: decouple kv-cache storage (#379)
Automatically bump version and release Python wheels #205: Commit d68a408 pushed by yzh119
July 18, 2024 08:38 17s main
July 18, 2024 08:38 17s
doc: update documentation to v0.1.0 (#378)
Automatically bump version and release Python wheels #204: Commit 9cb28de pushed by yzh119
July 18, 2024 05:50 16s main
July 18, 2024 05:50 16s
chore(main): release 0.1.0 (#373)
Automatically bump version and release Python wheels #203: Commit 58b68d0 pushed by yzh119
July 17, 2024 08:29 5h 37m 17s main
July 17, 2024 08:29 5h 37m 17s
feat: expose pytorch api for block sparse attention (#375)
Automatically bump version and release Python wheels #202: Commit 4bba6fa pushed by yzh119
July 17, 2024 08:28 26s main
July 17, 2024 08:28 26s
doc: fix typo (#376)
Automatically bump version and release Python wheels #201: Commit b2d5994 pushed by yzh119
July 13, 2024 18:31 26s main
July 13, 2024 18:31 26s
feat: Fused GPU sampling kernel for joint top-k & top-p sampling (#374)
Automatically bump version and release Python wheels #200: Commit 6e028eb pushed by yzh119
July 13, 2024 03:43 23s main
July 13, 2024 03:43 23s
feat: Add mask to merge_state_in_place (#372)
Automatically bump version and release Python wheels #199: Commit e14fa81 pushed by yzh119
July 13, 2024 02:09 25s main
July 13, 2024 02:09 25s
chore(main): release 0.0.9 (#359)
Automatically bump version and release Python wheels #198: Commit 17a5f1b pushed by yzh119
July 12, 2024 05:54 5h 30m 17s main
July 12, 2024 05:54 5h 30m 17s
refactor: reduce binary size by making kv_layout an argument instea…
Automatically bump version and release Python wheels #197: Commit 024a79f pushed by yzh119
July 12, 2024 05:31 29s main
July 12, 2024 05:31 29s
bugfix: fix the decode kernel segfault in cudagraph mode (#368)
Automatically bump version and release Python wheels #196: Commit c69cfab pushed by yzh119
July 11, 2024 06:16 24s main
July 11, 2024 06:16 24s