Intelligent Document Processing Skills
Overview
Use the File I/O skill set to let an iDialogue Agent process Salesforce Files for intelligent document processing (IDP). These skills give the agent a simple document workflow:
process_fileprepares a file for downstream use.read_fileretrieves processed file content in bounded chunks.write_filesaves summaries, notes, or other agent-authored outputs back to the file workspace.
The processed representation is stored internally as markdown. Users do not need to ask for markdown explicitly.
Who This Is For
Salesforce administrators, solution architects, and developers who configure agents to review contracts, invoices, case attachments, intake forms, and other unstructured business documents.
Requirements
- Agent model must be GPT-5.4.
- Agent must have one or more File I/O skills enabled:
process_file,read_file,write_file. - The request context must provide the target Salesforce File version as a
contentVersionId. - Background execution is recommended for longer document-processing runs or multi-step document workflows.
What Each Skill Does
process_file
Use process_file when the agent needs to extract, normalize, or prepare a file for downstream use.
- Best used when the agent is seeing a file for the first time.
- Creates the canonical processed artifact for the file workspace.
- Returns status and preview metadata instead of the entire file body.
read_file
Use read_file when the agent needs to inspect, quote, summarize, or continue working with a processed file.
- Reads the canonical processed artifact or another saved markdown artifact.
- Returns bounded content so large documents do not flood the conversation.
- Can optionally ensure the file has already been processed before reading.
write_file
Use write_file when the agent needs to save a derived document, notes, or summary.
- Persists agent-authored markdown back to the file workspace.
- Useful for outputs such as
summary.md,notes.md, or a refreshedfile.md. - Supports overwrite control so agents do not replace artifacts unintentionally.
Common Inputs
| Skill | Required Input | Common Optional Inputs | Typical Result |
|---|---|---|---|
process_file |
contentVersionId |
contentDocumentId, intent, outputName, force |
Canonical processed artifact plus metadata |
read_file |
contentVersionId |
name, offset, maxChars, ensureProcessed |
Bounded artifact body plus metadata |
write_file |
contentVersionId, markdown |
name, overwrite |
Saved artifact plus checksum and byte count |
Admin Setup Checklist
- Add the File I/O skills your agent needs:
process_file,read_file, and optionallywrite_file. - Set the agent model to GPT-5.4.
- Make sure your invocation flow can provide the target
contentVersionId. - Update the system prompt so the agent knows when to process a file, when to read it, and when to save derived outputs.
- Use background execution for long-running document jobs or multi-file workflows.
Prompt Guidance Snippet
Use this in your system or skill prompt to steer document-processing behavior:
When a user asks you to review, extract, analyze, or summarize a Salesforce File,
call process_file with the file's contentVersionId if the file has not been prepared yet.
Use read_file to inspect the processed content in bounded chunks.
Use write_file only when you need to save a summary, notes, or another derived document
back to the file workspace. Do not ask the user to provide markdown format.
Examples
Contract Review
- A user asks the agent to review a contract attached to a Salesforce record.
- The agent calls
process_filewith the contractcontentVersionId. - The agent calls
read_fileto inspect the processed content. - The agent answers the user and optionally saves a
summary.mdartifact withwrite_file.
Invoice Intake
- A user asks the agent to extract billing details from a PDF invoice.
- The agent calls
process_fileto prepare the invoice. - The agent calls
read_fileto pull the relevant sections. - The agent returns the extracted values and can save reconciliation notes with
write_file.
Policy or Case Attachment Follow-Up
- A user asks for a concise summary of a policy document or case attachment.
- The agent uses
read_fileif the file has already been processed. - If the file is not ready yet, the agent processes it first, then reads the relevant section.
- The agent can save a reusable notes artifact for future follow-up.
Recommended Prompt Pattern
- Use
process_filefirst when the document has not been normalized yet. - Use
read_filefor inspection, quoting, summarization, or follow-up analysis. - Use
write_fileonly for derived outputs you want the agent to save. - Treat the internal markdown representation as an implementation detail, not as end-user language.
Troubleshooting
- Agent is answering without using the file:
- Confirm the agent has the File I/O skills enabled and the prompt tells it to call them.
read_filecannot find content:- Process the file first, or configure the agent to use
ensureProcessed=truewhen appropriate.
- Process the file first, or configure the agent to use
write_filefails because the artifact already exists:- Choose a different artifact name or explicitly allow overwrite when replacement is intended.
- Long documents take multiple steps:
- Use background execution and let the agent save summaries or notes instead of trying to return the entire document body in one response.