All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Issue #25 Fix: The
web_searchtool now returns an error when unknown parameters are passed (e.g.,engineinstead ofengines). Previously, unknown parameters were silently ignored, causing confusion when users specified the wrong field name.
enginesparameter type changed fromstringtoarrayin schema to match actual API- Updated a3s-search integration to v1.0.0
-
Built-in Git Client: New
gittool with auto-install support for Windows, macOS, and Linux. Downloads official pre-built git binaries to~/.local/git/bin/when git is not available - no package manager required.Full git operations:
status,log,branch,checkout,diff,stash,remote,worktree -
Git Convenience Methods: Python SDK (
session.git(...)) and Node SDK (session.git(...)) convenience methods for git operations.
- Updated all system prompts to reference "A3S Code" instead of "Claude Code"
- Updated skill references to use
a3s-lab/code-skills
-
Document Parser: Removed
composite_document_parseranddocumentmodules and all related code. This feature was not fully implemented and has been removed to simplify the codebase. -
Agentic Search/Parse Tools: Removed
agentic_searchandagentic_parsebuilt-in tools. -
Git Worktree Tool: Replaced by the new unified
gittool withworktreesubcommand.
- Tool Count: Updated built-in tool count from 15 to 16 to reflect new git and box tools.
- Documentation: Updated all documentation to reflect new tool names and capabilities.
-
XLSB (Excel Binary) Support: Added calamine-based BIFF12 parsing for XLSB files with proper cell value extraction, supporting Float, Int, Bool, DateTime, DateTimeIso, and DurationIso types. Significantly improves table fidelity for .xlsb files.
-
HWPX Table Extraction: Added structured table extraction from Korean HWPX documents. Parses
tbl/tr/tcXML hierarchy and includesstructured_payloadfortables[]output. -
Vision OCR Provider: New OCR backend supporting OpenAI-compatible vision APIs for document OCR fallback.
document_parser { ocr { enabled = true model = "openai/gpt-4.1-mini" api_key = "sk-..." base_url = "https://api.openai.com/v1" # optional prompt = "Extract all text from this document..." max_images = 8 dpi = 144 } }
Provider priority: External provider > Vision API (if model+api_key configured) > Builtin tesseract
-
Tabular Query Intent Detection: Automatically detects when queries relate to tables (keywords: table, column, row, spreadsheet, excel, csv, cell, data, record, etc.) and boosts table line matches by +10 keyword hits plus 1.3x relevance multiplier.
-
Heading Inheritance Boost: When search matches appear under headings that also match the query, those matches receive a relevance boost (up to 1.3x). Looks backwards to find the closest preceding heading.
DocumentOcrConfigextended with new fields:provider: Option<String>- Backend selection ("vision" or "builtin")base_url: Option<String>- Custom API endpointapi_key: Option<String>- API authentication
- Added
calamine = "0.26"for XLSB parsing - Added
reqwest/blockingfeature for Vision API HTTP calls
- Test assertion:
paged_text_blocks_reflow_two_column_preserves_paragraph_breaks- Corrected expected string "Parser metadata now tracks OCR" vs "Parser metadata now tracks OCR backend"
-
Phase 1 structured result surfaces:
structured_payloadexposed inagentic_parseoutput and metadata- Table payloads in stable machine-readable form
- Page-level data in
agentic_parseoutput and metadata - Stable
tables[],pages[],elements[]outputs
-
Phase 2 PDF extraction improvements:
- lopdf position-aware text extraction
- Reduced dependence on weak text fallbacks
- Position-aware table detection
-
agentic_searchenhancements:- Chunk context consumption
- Tabular content consumption
- Page numbers and locators support
ParsedDocumentextended withtables: Vec<StructuredTable>andpages: Vec<PageInfo>
- Windows shell compatibility improvements
- Runtime session header support for OpenAI configs
- Cross-platform environment variable expansion in tests
- Enhanced agent config, document parser, LLM, tools, and SDKs
- Host shell environment propagation to tool commands
- Zhipu AI client (
ZhipuClientformerlyGlmClient) - Duplicate tool call circuit breaker
- Streaming fallback support
agentic_parseskill
- Session-local skill registries
- Tool schema hardening
- Slash command output restoration