bijouvj

bijouvj

Achievements

ST-summer-2024 ST-summer-2024 Public

Summer internship in ML.

Jupyter Notebook
lmcache-vllm lmcache-vllm Public

Forked from LMCache/lmcache-vllm

The driver for LMCache core to run in vLLM

Python
lmc-kvikio lmc-kvikio Public
LMCache LMCache Public

Forked from LMCache/LMCache

10x Faster Long-Context LLM By Smart KV Cache Optimizations

Python
TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++
gds-pyt gds-pyt Public

PyTorch tensor load/store with GDS

Python