Skip to content

FasterTransformer Backend

FasterTransformer Backend

NOTE

FasterTransformer development has transitioned to TensorRT-LLM. All developers are encouraged to leverage TensorRT-LLM and tensorrtllm_backend to get the latest improvements on LLM Inference. The NVIDIA/FasterTransformer repo will stay up, but will not have further development.

See also

  • FauxPilot - 오픈소스 GitHub CoPilot 서버

Favorite site