Downloading Documents Safely
Topograph delivers document download links as signed URLs with a 15-minute lifespan. Each time you call/v2/company
with the requestId
from your original request, the API regenerates fresh signed URLs. We recommend either downloading files immediately or reusing /v2/company
with the stored requestId
when you need a new link.
Automatic PDF Post-Processing
Some registries publish documents in formats such as TIFF, XLS/XLSX, DOC/DOCX, or PPT. When a file requires conversion, the document remains listed in the response with the originalurl
, but the processing status under request.dataStatus.documents
stays in_progress
until the conversion completes (typically ~20 seconds). In critical workflows, you can use the original url
immediately if you do not need the PDF.
Once conversion finishes, the document payload includes:
url
: the original file provided by the registrypdfUrl
: a PDF generated by Topograph (when available)
url
and pdfUrl
are identical. We are working on introducing more granular document statuses in the coming weeks.
Best Practices
- Store request IDs: When you acquire documents, keep the
requestId
so you can refresh signed URLs later without extra cost. - Plan download timing: Because signed URLs expire quickly, build your workflow to download or refresh links promptly.
- Monitor status: Watch
request.dataStatus.documents[docId].status
. If it is stillin_progress
, a background conversion may be running.
AI-Powered Financial Data Extraction
Topograph automatically extracts structured financial data from PDF financial statements using advanced AI technology. This feature requires no additional configuration - extraction happens automatically during document post-processing.How It Works
When a financial statement document is retrieved, our system:- Analyzes the PDF content to determine if it’s a financial statement
- Extracts key financial metrics and metadata if applicable
- Returns the structured data in the
extractedData.financialData
field
Example Response
Here’s an example of a financial statement with extracted data:Key Points
- Automatic processing - No extra API calls or configuration needed
- Non-financial documents - Documents that aren’t financial statements will not have the
extractedData
field - Language agnostic - Works with financial statements in various languages
- Beta feature - The data model may evolve based on user feedback