Downloading Documents Safely
Topograph delivers document download links as signed URLs with a 15-minute lifespan. Each time you call/v2/company with the requestId from your original request, the API regenerates fresh signed URLs. We recommend either downloading files immediately or reusing /v2/company with the stored requestId when you need a new link.
Automatic PDF Post-Processing
Some registers publish documents in various formats that we convert to PDF. Supported formats include:- Images: JPG, JPEG, PNG, TIFF, TIF
- Spreadsheets: CSV, XLS, XLSX
- Documents: DOC, DOCX, RTF, TXT, XPS
- Presentations: PPT, PPTX
url, but the processing status under request.dataStatus.documents stays in_progress until the conversion completes (typically ~20 seconds). You can use the original url immediately if you do not need the PDF.
When conversion is done, the document payload includes:
url: the original file provided by the registerpdfUrl: a PDF generated by Topograph (when available)
url and pdfUrl are identical.
Best Practices
- Store request IDs: When you acquire documents, keep the
requestIdso you can refresh signed URLs later at no extra cost. - Plan download timing: Because signed URLs expire quickly, build your workflow to download or refresh links promptly.
- Monitor status: Watch
request.dataStatus.documents[docId].status. If it is stillin_progress, a background conversion may be running.
AI-Powered Financial Data Extraction
Topograph automatically extracts structured financial data from PDF annual statements. This feature requires no additional configuration—extraction happens during document post-processing. Financial statements will include anextractedData.financialData field with structured metrics.
For detailed information about how it works, the data model, and all available fields, see our Financial Data Extraction documentation.
For a broader picture of how document retrieval fits into the company flow, return to the verification data guide.