Click any tag below to further narrow down your results
Links
Tokenflood is a tool designed for load testing instruction-tuned large language models (LLMs). It allows users to define various parameters like prompt lengths and request rates without needing specific prompt data, making it easier to assess latency and performance across different providers and configurations. Users should be cautious of potential costs when using pay-per-token services.