Run local LLMs using SmallBASIC

chrisws · December 31, 2025, 6:17am

I’ve created a new module that uses llama.cpp to run LLM models locally.

Here’s an example:

github.com/smallbasic/smallbasic.plugins

llama/samples/adventure.bas

llama.cpp

import llm

' Configuration
const n_ctx = 5000
const n_batch = 512
const model_path = "models/Qwen_Qwen2.5-1.5B-Instruct-GGUF-Q4/qwen2.5-1.5b-instruct-q4_k_m.gguf"
const max_turns = 10

' Initialize two separate LLM instances
const storyteller = llm.llama(model_path, n_ctx, n_batch)
const player = llm.llama(model_path, n_ctx, n_batch)

' Configure Storyteller (creative, descriptive)
storyteller.set_max_tokens(150)
storyteller.set_temperature(0.8)
storyteller.set_top_k(80)
storyteller.set_top_p(0.95)
storyteller.set_min_p(0.05)
storyteller.set_penalty_repeat(1.2)
storyteller.set_penalty_last_n(128)

This file has been truncated. show original

The module has build options for CPU or GPU. CPU almost kind of works with the smallest models. I bought a new graphics card and this gave a huge improvement.

Topic		Replies	Views
Your software written in SmallBASIC Software	0	87	June 6, 2025
Installation of SmallBASIC on W11 pc Site Feedback	1	72	August 10, 2025
SmallBASIC 12.33 News	6	69	March 13, 2026
Bug reports and other issues Bugs	0	33	June 6, 2025
News and Announcements News	0	33	June 6, 2025

Run local LLMs using SmallBASIC

Related topics