llm-fillin
LLM-powered slot filling + tool orchestration for Ruby.
Turn natural language into structured, validated function calls.
llm-fillin provides the glue between an LLM and your backend services.
Install
bundle installConfigure
Set your OpenAI API key:
export OPENAI_API_KEY=sk-...Run the demo
ruby examples/run_create_toy.rbExample Conversation
AI: Got it! You want a red race car toy for $12. Could you please provide a name for this toy?
YOU: Supra
AI: You want a red race car toy named "Supra" for $12. Just to confirm, is the category of the toy "car"?
YOU: what other category are there?
AI: The toy categories available are plush, puzzle, doll, car, lego, and other. Would you like to proceed with the "car" category for the red race car toy named Supra?
YOU: make it lego
AI: You want a red Lego race car toy priced at $12. Just to confirm, is the toy name "Supra"?
YOU: yes
✅ Toy created: Supra (lego, red) - $12.0 | ID TOY-6A99ED
Quick Start (using as a gem)
Once you’ve installed the gem:
gem "llm-fillin"and bundled:
bundle installYou can use it in your app like this:
require "llm/fillin"
require "json"
# 1. Create a registry and register your tools
registry = LLM::Fillin::Registry.new
CREATE_TOY_V1 = {
  "type" => "object",
  "properties" => {
    "name"     => { "type" => "string" },
    "category" => { "type" => "string", "enum" => %w[plush puzzle doll car lego other] },
    "price"    => { "type" => "integer" },
    "color"    => { "type" => "string" }
  },
  "required" => %w[name category price color]
}
registry.register!(
  name: "create_toy", version: "v1",
  schema: CREATE_TOY_V1,
  description: "Create a toy with name, category, price, and color.",
  handler: ->(args, ctx) {
    key = "chat-#{ctx[:thread_id]}-#{SecureRandom.hex(6)}"
    { id: "TOY-#{SecureRandom.hex(4)}", idempotency_key: key }.merge(args)
  }
)
# 2. Set up the adapter, store, and orchestrator
adapter = LLM::Fillin::OpenAIAdapter.new(
  api_key: ENV.fetch("OPENAI_API_KEY"),
  model: "gpt-4o-mini"
)
store = LLM::Fillin::StoreMemory.new
orchestrator = LLM::Fillin::Orchestrator.new(
  adapter: adapter,
  registry: registry,
  store: store
)
# 3. Send messages through the orchestrator
thread_id = "demo-thread"
tenant_id = "demo-tenant"
actor_id  = "demo-user"
loop do
  print "YOU: "
  user_input = STDIN.gets.strip
  break if user_input.empty?
  res = orchestrator.step(
    thread_id: thread_id,
    tenant_id: tenant_id,
    actor_id: actor_id,
    messages: [{ role: "user", content: user_input }]
  )
  case res[:type]
  when :assistant
    puts "AI: #{res[:text]}"
  when :tool_ran
    puts "✅ Tool ran: #{res[:tool_name]} => #{res[:result].to_json}"
  end
endExample Run
YOU: create me a toy for $15
AI: Got it! What’s the name of the toy?
YOU: Pikachu
AI: Great, should the category be plush, puzzle, doll, car, lego, or other?
YOU: plush
AI: And what color should Pikachu be?
YOU: yellow
✅ Tool ran: create_toy => {"id":"TOY-5A9F","idempotency_key":"chat-demo-thread-...","name":"Pikachu","category":"plush","price":1500,"color":"yellow"}
How it works (Architecture)
When you interact with the assistant, several components collaborate:
1) Registry
The Registry is where tools (functions) are registered so the LLM can call them.
Each tool has:
- 
name+version - 
schema(JSON Schema that defines the inputs) - 
description(human-readable prompt for the LLM) - 
handler(Ruby lambda that executes the action) 
Example
registry.register!(
  name: "create_toy", version: "v1",
  schema: CREATE_TOY_V1,
  description: "Create a toy with name, category, price, and color.",
  handler: ->(args, ctx) { Toy.create!(args.merge(ctx)) }
)2) Validators
All arguments are validated with json_schemer before a handler is called:
- Required fields must exist
 - Types must match (string, integer, enum, etc.)
 - No unexpected fields are allowed
 
This keeps your backend safe.
3) Idempotency
When creating resources, an idempotency key is generated with:
"chat-<thread_id>-#{SecureRandom.hex(6)}"This ensures the same request doesn’t create duplicates.
4) Orchestrator
The orchestrator coordinates everything:
- Supplies registered tools to the LLM
 - Tracks conversation state and tool results
 - Parses LLM output (text vs. function call)
 - Validates arguments and calls the handler
 - Stores tool results in memory
 
This is what lets the AI ask for missing fields naturally.
5) Adapters
Adapters handle communication with the LLM.
Currently there’s an OpenAIAdapter that:
- Sends prompts, tools, and messages to OpenAI
 - Parses responses (
tool_callsor legacyfunction_call) - Wraps tool results for reuse in the conversation
 
Other adapters could be added later for Anthropic, Mistral, or local LLMs.
6) StoreMemory
A simple in-memory store for tool results by thread.
In production, this could be replaced by Redis or a database.
End-to-End Flow
- 
User says: 
"I want a red race car toy for $12". - 
LLM identifies intent: 
"create_toy_v1". - LLM fills slots: price = 1200, color = red. Missing: name, category.
 - LLM asks user: “What’s the name?” → “Supra”.
 - LLM asks user: “Is the category car?” → “make it lego”.
 - 
Arguments validated by 
Validators. - Handler executes with args + context + idempotency key.
 - Result returned: structured hash (toy ID, name, category, etc.).
 - Assistant replies: ✅ Toy created.
 
Use in your app
- Define tools (schemas + handlers).
 - Register them in the 
Registry. - Wire the 
Orchestratorwith anAdapterand aStore. - Pass messages into 
orchestrator.step. - Handle either assistant replies or tool execution results.
 
License
MIT