Bundata on Azure

This documentation provides guidance for running and integrating Bundata with Microsoft Azure. Bundata can ingest from and deliver to Azure storage and identity services so you can keep document intelligence within your Azure environment.

Overview

Bundata on Azure supports:

  • Ingestion from Azure Blob Storage — Read documents from Blob containers for partitioning, extraction, and enrichment.
  • Azure identity — Use Azure AD and managed identities so Bundata accesses only the resources you allow.
  • Deployment options — Run Bundata as a managed service that connects to your Azure subscription, or deploy in your own VNet for full control.

Documentation is organized by cloud provider. Use the main docs navigation to switch between AWS, Azure, and Google Cloud as needed.

Getting started

  1. Sign up for Bundata — Create an account and choose a plan that supports Azure connectivity.
  2. Configure an Azure Blob source — In the Bundata UI or via API, add an Azure Blob connector with your storage account, container, and credentials (or managed identity).
  3. Run a pipeline — Define a schema, run extraction and enrichment, and send results to Vector Catalog, another Blob container, or a different destination.

Data and storage

  • Source documents — Store raw PDFs, DOCX, and other files in Azure Blob Storage. Bundata reads from the containers and paths you configure.
  • Output — Write smart bites, embeddings, and metadata back to Blob Storage, or to Bundata’s Vector Catalog and other supported destinations.
  • Security — Use storage account keys or Azure AD (e.g. managed identity) with least-privilege access. Prefer managed identity where possible.

Integration with Azure services

  • Azure Blob Storage — Primary storage for source and (optionally) output data.
  • Azure Active Directory — Authentication and authorization for storage and other Azure APIs used by Bundata.
  • Azure VNet — For in-VNet or private connectivity, Bundata can run inside your virtual network for low-latency and locked-down access.

For detailed networking, compliance, and region options, refer to your Bundata account documentation or contact support.

Try Bundata on Azure

  • Quickstart — Run your first pipeline with an Azure Blob source.
  • API Reference — Configure connectors and batch jobs via API.
  • Overview — Key functionality and use cases.

Other clouds