Craft Agent includes built-in tools for working with common document formats. Convert between formats, extract text, merge files, create reports, and process images — all from within your conversation.Documentation Index
Fetch the complete documentation index at: https://agents.craft.do/docs/llms.txt
Use this file to discover all available pages before exploring further.
Quick Reference
| Tool | What it does | Formats |
|---|---|---|
| markitdown | Convert any document to Markdown | .docx, .xlsx, .pptx, .pdf, .html, .ipynb, and more |
| pdf-tool | Extract text, merge, split, get info | |
| xlsx-tool | Read, write, export spreadsheets | .xlsx, .csv |
| docx-tool | Create and edit Word documents | .docx |
| pptx-tool | Read and inspect presentations | .pptx |
| img-tool | Resize, convert, get metadata | .png, .jpg, .webp, .gif, .svg |
| doc-diff | Compare two documents | Any two files |
| ical-tool | Read and parse calendar files | .ics |
Common Tasks
Convert a Document to Text
The most common operation — extract readable text from any document:Work with PDFs
Process Spreadsheets
Create Word Documents
Process Images
Compare Documents
Parse Calendar Files
.ics calendar files and extracts event details (title, date, time, location, attendees).
What if the agent can't read a file directly?
What if the agent can't read a file directly?
If the agent encounters a binary file it can’t read natively (like .docx or .pptx), it automatically falls back to markitdown to convert the document to readable text. You don’t need to do anything — just ask the agent to read the file.
Where do created files go?
Where do created files go?
When the agent creates or converts documents, the output files are saved in the current working directory (or a path you specify). The agent will tell you the exact file path after creation.
Can I chain operations?
Can I chain operations?
Absolutely. You can ask for multi-step workflows like “Read this PDF, extract the financial data, and create an Excel spreadsheet with just the quarterly revenue figures.” The agent will use multiple tools in sequence.