Interactive Chat
System:
You are a helpful AI assistant powered by LlamaPHP.
Text Generation
Output
Output will appear here...
Real-time Streaming
Streaming shows tokens as they are generated, using the new generateStream() method.
Loading...
Ready to stream
Text Embeddings
Note: Your model must support embeddings (look for models with "all-MiniLM", "bge", "e5" in name).
Embedding 1
[]
Dimension: -
Embedding 2
[]
Dimension: -
Cosine Similarity
0.00
Higher values indicate more similar meaning
Configuration & Status
qwen3-0.6b-q4_k_m.gguf
/app/web/../models/qwen3-0.6b-q4_k_m.gguf
Size: 0.45 GB
Last modified: 2025-12-30 18:03:14
Model file found and accessible.
llama-cli
/usr/local/bin/llama-cli
Executable: Yes
Permissions: 0755
Binary found and executable.
Test the API endpoints directly: