Welcome to local_llm_kit’s documentation!

local_llm_kit is a Python package that provides an OpenAI-like interface for local language models. It allows you to run language models locally while maintaining compatibility with OpenAI’s API structure.

Advanced Topics:

Development:

Contributing

Features

OpenAI-compatible API interface
Support for multiple model backends (Transformers, llama.cpp)
Chat completion API
Function calling capability
Streaming responses
JSON mode output
Memory management
Custom prompt formatting
Extensive documentation and tutorials
Performance optimization guides
Community-driven development

Quick Installation

pip install local-llm-kit

Quick Example

from local_llm_kit import LLMClient

# Initialize the client
client = LLMClient(model="llama2")

# Chat completion
response = client.chat.completions.create(
    model="llama2",
    messages=[
        {"role": "user", "content": "What is the capital of France?"}
    ]
)

print(response.choices[0].message.content)

Getting Help

If you need help using local_llm_kit, you have several options:

Check the Tutorials for step-by-step guides
Read the API Reference for detailed API documentation
Look through Examples for common use cases
Visit our GitHub Issues page
Join our community discussions

Contributing

We welcome contributions! Please see our Contributing guide for details on how to:

Report bugs
Suggest features
Submit pull requests
Improve documentation

License

local_llm_kit is released under the MIT License. See the LICENSE file for more details.