A collection of tools and utilities for working with Large Language Models (LLMs), designed to help developers test, benchmark, and optimize their AI applications.
This repository contains a curated collection of tools for LLM development and testing. Each tool is designed to solve specific challenges in working with language models, from performance benchmarking to API management and automation.
A comprehensive benchmarking tool for evaluating and comparing LLM performance across different models and providers.
Location: llm-benchmark/
Features:
- Automated Testing: Run standardized test questions against multiple LLM models
- Performance Metrics: Track response times, token usage, and costs
- Flexible Configuration: Support for multiple providers through environment variables
- Custom Question Sets: Create and manage your own benchmark questions
- Results Visualization: Built-in viewer for analyzing benchmark results
- Code-Only Prompting: Specialized testing for code generation capabilities
Quick Start:
cd llm-benchmark
pip install -r requirements.txt
cp .env.example .env
# Configure your API keys in .env
python verify_setup.py
python run.pyDocumentation:
- Full Documentation - Complete guide to the benchmark tool
- Quick Start Guide - Get up and running in 5 minutes
- Questions Guide - How to create custom benchmark questions
- Code Prompting Guide - Testing code generation
- Changelog - Version history and updates
Each tool in this repository is self-contained with its own documentation and setup instructions. Navigate to the specific project directory for detailed guides.
- Python 3.8 or higher
- API keys for LLM providers you want to use
- Basic understanding of LLM APIs and concepts
Most tools follow a similar setup process:
-
Navigate to the tool directory
cd <tool-name>
-
Install dependencies
pip install -r requirements.txt
-
Configure environment variables
cp .env.example .env # Edit .env with your API keys -
Run the tool
python run.py # or the main script for that tool
This repository is part of a larger ecosystem of LLM tools:
- LLM-API-Key-Proxy - Universal LLM API proxy with resilience and key management
- Mirrobot-agent - AI-powered GitHub bot for automated issue analysis and PR reviews
- Mirrobot-py - Discord bot with LLM integration and advanced features
These projects work seamlessly together to provide a complete LLM development and deployment solution.
Contributions are welcome! Whether you want to:
- Add new tools to the collection
- Improve existing functionality
- Fix bugs or add features
- Enhance documentation
Please follow these guidelines:
- Fork the Repository and clone it locally
- Create a Feature Branch (
git checkout -b feature/amazing-feature) - Make Your Changes, following existing code style and patterns
- Test Your Changes thoroughly
- Commit Your Changes with descriptive commit messages
- Push to Your Fork (
git push origin feature/amazing-feature) - Open a Pull Request with a clear description of your changes
If you need help or have questions:
- Check the documentation for the specific tool you're using
- Visit the GitHub Issues page to report bugs or request features
- Support the project on Ko-fi if you find it useful
This project is open source. Please check individual tool directories for specific license information.
Note: This is an active project with tools being added and updated regularly. Star the repository to stay updated with new additions!