Skip to content

kdb products

Late data

kdb products

Home
kdb+ and q
kdb Insights SDK
kdb Insights SDK
- About
- Free Trial
- Prerequisites
- Core
  Core
  - About
  - Install
  - Object storage
    Object storage
    
    About
    
    Quickstart
    
    Caching
    
    Examples
  - SQL
    SQL
    
    About
    
    SQL Reference
    SQL Reference
    
    Operators
    
    Functions
    
    Data and Literals
    
    Select Statements
    
    Table Creation
    
    ANSI SQL Compliance
  - Postgres SQL Interface
  - REST API
    REST API
    
    Client
    Client
    
    About
    
    Quickstart
    
    Workflows
    
    Examples
    Examples
    
    Async
    
    Follow redirects
    
    Response headers
    
    Timeouts
    
    Azure API Management
    
    GCP Identity Aware Proxy
    
    Server
    Server
    
    About
    
    Quickstart
    
    API reference
    
    Examples
    Examples
    
    customers
    
    queryclient
    
    queryserver
    
    queryworker
    
    OpenAPI Sample
  - Google BigQuery API
    Google BigQuery API
    
    About
    
    Quickstart
    
    Main
    
    Discovery
    
    Query
    
    Projects
    
    Datasets
    
    Tables
    
    Tabledata
    
    Helpers
    
    Configuration
    
    API
    
    Troubleshooting
  - Packaging
    Packaging
    
    About
    
    Quickstart
    
    Examples
    Examples
    
    About the examples
    
    Basic Tick
    
    Hello C
    
    Labeling
  - Logging
    Logging
    
    About
    
    Quickstart
    
    API reference
  - Release notes
    Release notes
    
    Latest
    
    Previous
- Database
  Database
  - Overview
  - Data Configuration
    Data Configuration
    
    Overview
    
    Routing
    
    Assembly
    Assembly
    
    Database
    
    Schema
    
    Storage
    
    Query
    
    Stream
    
    Aggregation
    
    User Defined Analytics
    
    Advanced
    Advanced
    
    Overview
    
    Query scaling
    
    Authorization
    Authorization
    
    Custom IPC Authorization
    
    Custom HTTP Authorization
    
    Query IPC Externally
  - Data Storage
    Data Storage
    
    Overview
    
    Storage Tiering
    
    Object Storage
    
    Delete Rows
    
    Backup and Restore
    
    Event Hooks
  - Data Import
    Data Import
    
    Import Overview
    
    Initial Import
    Initial Import
    
    Overview
    
    Prerequisites
    
    Quickstart
    
    Initial Import Process
    
    Schema Creation
    
    Troubleshooting
    
    Batch Ingest
  - Data Query
    Data Query
    
    Overview
    
    Purviews
    
    Scope
    
    Late data
    
    Manual EOD Trigger
    
    Reference data
    
    Routing
    
    Queuing, retries, and timeouts
    
    Resilience
    
    Logging
    
    Troubleshooting
    
    Advanced
    Advanced
    
    Query existing object storage
  - Querying methods
    Querying methods
    
    REST vs QIPC
    
    SQL
  - Monitoring
  - Best practices
    Best practices
    
    Late Data
    
    Manual EOD Trigger
    
    Performance
  - Deploying
    Deploying
    
    Overview
    
    Docker
    Docker
    
    Database
    
    Basic
    
    Metrics
    
    Kubernetes
    Kubernetes
    
    Database
  - Downgrading
  - Glossary
- Stream Processor
  Stream Processor
  - About Streaming Data
  - Quickstart
    Quickstart
    
    Docker
    
    Kubernetes
  - Writing
  - Running
  - Configuration
  - Insights Ingest
  - Examples
    Examples
    
    Static file
    
    Batch S3 ingestion
    
    Kafka
    Kafka
    
    Getting started
    
    Setup Kafka
    
    Basic ingestion
    
    Setup Kafka TLS
    
    Ingestion with TLS
    
    PostgreSQL Querying
    
    Pipeline Replicas
    
    Stateful operators
    
    Enriching streams
    
    Windowing on event time
    
    Windowing on processing time
    
    kdb+ tick (callback)
    
    Reader Triggering
  - Concepts
    Concepts
    
    Checkpoints and recovery
    
    Determinism
    
    Glob patterns
    
    Scaling
    
    State
- Reliable Transport
  Reliable Transport
  - About
  - Quickstart
    Quickstart
    
    About
    
    Docker
    
    Kubernetes
  - Publishers
    Publishers
    
    Overview
  - Subscribers
  - Interfaces
    Interfaces
    
    Getting started
    
    C
    C
    
    Using the C interface
    
    C samples
    
    Java
    Java
    
    Using the Java interface
    
    Java samples
    
    Python
    Python
    
    Using the Python interface
    
    Python samples
    
    q (rt.qpk)
  - Examples
    Examples
    
    Publishing to Enterprise using q
    
    Recovering archived logs
    
    Running RT outside of a container
  - Configuration
    Configuration
    
    Overview
    
    Diagnostics
    
    Monitoring
  - Administration
    Administration
    
    Soft reset
    
    Hard reset
- KX for Databricks
  KX for Databricks
- Release notes
  Release notes
  - Latest
  - Previous
- Extras
  Extras
  - Tutorials
    Tutorials
    
    Streaming to a web-socket client
  - Machine Learning
    Machine Learning
    
    About
    
    Quickstart
    Quickstart
    
    Docker
    
    Kubernetes
    
    Examples
    Examples
    
    Model Generation & Deployment
kdb Insights Enterprise
kdb Insights Enterprise
- Home
- About
  About
  - Overview
  - Interfaces
- Architecture
- Install
  Install
  - Overview
  - Free Trial
    Free Trial
    
    7 day Free Trial
    
    Product Tour
  - Azure Marketplace
    Azure Marketplace
    
    Offers
    
    Prerequisites
    
    Permissions
    
    User Node Pool Sizing
    
    KX Managed
    KX Managed
    
    About
    
    Install
    
    Login
    
    Billing
    Billing
    
    About
    
    FAQ
    
    Security
    
    Licensing
    
    Troubleshooting
    
    Release Notes
    Release Notes
    
    Latest
    
    Previous
    
    License only
    
    Private offers
    
    Azure Integrations
    Azure Integrations
    
    Azure Data Factory
    
    Microsoft Entra ID
    Microsoft Entra ID
    
    Microsoft Entra ID
    
    Microsoft Entra Keycloak Composite Roles
    
    Azure Monitoring
    Azure Monitoring
    
    Alert Configuration
    
    Workbook Configuration
    
    PowerBI
    
    Kubernetes system upgrade
    
    Support
    Support
    
    KX Support
    
    Azure Secrets
  - Standalone
    Standalone
    
    Infrastructure
    Infrastructure
    
    Managed K8S
    Managed K8S
    
    Prerequisites
    
    Terraform
    Terraform
    
    Deployment
    Deployment
    
    Overview
    
    Cloud provider
    Cloud provider
    
    GCP
    
    AWS
    
    Azure
    
    Troubleshooting
    
    Interacting with Kubernetes
    
    DNS setup
    
    On-Prem OpenShift
    On-Prem OpenShift
    
    Prerequisites
    
    On-Prem K8S
    On-Prem K8S
    
    Prerequisites
    
    Installation
    Installation
    
    Installing
    
    Validation
    
    Upgrading
    
    Air-gapped environments
- Use
  Use
  - Web Interface
    Web Interface
    
    Get Started
    
    Overview
    Overview
    
    Log in
    
    Web Interface Overview
    
    System Information
    
    Databases
    Databases
    
    Create & manage
    
    Database Settings
    
    Schema Settings
    
    Stream Settings
    
    Database Resources
    
    Deploying
    
    Pipelines
    Pipelines
    
    Import wizard
    
    Build & manage
    
    Test
    
    Settings
    
    Operators
    Operators
    
    Overview
    
    Readers
    
    Writers
    
    Functions
    
    Decoders
    
    Encoders
    
    Transform
    
    Stats
    
    Windows
    
    Machine Learning
    
    String Utilities
    
    Troubleshooting
    
    Queries
    Queries
    
    Queries index
    
    Query window
    
    Query panel
    
    Scratchpad
    
    Scratchpad using q
    
    Scratchpad using Python
    
    Query APIs
    
    Views
    Views
    
    Views index
    
    Quickstart guide to Views
    
    Guide to building Views
    
    Packages
    Packages
    
    Packages
    
    Diagnostics
    Diagnostics
    
    Diagnosing deployments
    
    Guided walkthroughs
    Guided walkthroughs
    
    Index
    
    Ingest and Query
    Ingest and Query
    
    Database
    
    Object Storage
    
    Kafka
    
    SQL Database
    
    Protocol Buffer
    
    Query
    
    Visualize
    Visualize
    
    Build a View
    
    Maps
    
    Streaming
    
    Tutorials
    Tutorials
    
    Index
    
    Finance
    Finance
    
    Backtest trading strategies
    
    Run ML model in real-time
    
    Manufacturing
    
    Parquet
  - Configure a Database
    Configure a Database
    
    Overview
    
    Configuration options
    Configuration options
    
    Overview
    
    Routing
    
    Package
    Package
    
    Database
    
    Schema
    
    Storage
    
    Routing
    
    Query
    
    Stream
    
    Reference
    
    Aggregation
    
    User defined analytics
    
    Advanced
    Advanced
    
    Overview
    
    Query scaling
    
    Monitoring
    
    Best practices
    Best practices
    
    Late data
    
    Manual EOD Trigger
    
    Performance
    
    Upgrading
    
    Glossary
  - Data Storage
    Data Storage
    
    Overview
    
    Storage Tiering
    
    Object Storage
    
    Delete Rows
    
    Backup and Restore
    
    Event Hooks
  - Data Import
    Data Import
    
    Import Overview
    
    Initial Import
    Initial Import
    
    Overview
    
    Prerequisites
    
    Quickstart
    
    Initial Import Process
    
    Schema Creation
    
    Troubleshooting
    
    Batch Ingest
  - Ingest & Transform
    Ingest & Transform
    
    Overview
    
    Examples
    Examples
    
    Overview
    
    Kafka
    
    PostgreSQL query
    PostgreSQL query
    
    Getting started
    
    Setup PostgreSQL
    
    Querying PostgreSQL
    
    Batch S3 ingest
    
    Machine learning
    Machine learning
    
    Fitting model on S3 data
    
    Fitting model on Kafka data
    
    Using language interfaces
  - Querying data
    Querying data
    
    Overview
    
    Purviews
    
    Scope
    
    Late data
    
    Reference data
    
    Routing
    
    Queuing, retries and timeouts
    
    Query methods
    Query methods
    
    REST vs QIPC
    
    SQL
    
    Java interface
    
    PowerBI
    
    Resilience
    
    Logging
    
    Troubleshooting
  - Packaging
    Packaging
    
    Package Overview
    
    Configure package
    
    Create package
    
    Manage deployment components
    
    Manage runtime components
    
    Manage functions within a package
    
    Manage dependent & patch components
    
    Edit components
    
    Upload package
    
    Deploy package
    
    Automated package deployment
    
    Use package
    
    List packages
    
    Download package
    
    Teardown package
    
    Delete package
    
    Pack package
    
    Convert assembly to package
- Administer
  Administer
  - Command line interface
    Command line interface
    
    Overview
    
    Installing the CLI
    
    Configuration
    
    Authentication
    
    Backup and Restore
    
    Reference
  - Entitlements
    Entitlements
    
    Overview
    
    Prerequisites
    
    Configuration
    
    Data Entitlements
    Data Entitlements
    
    Overview
    
    Data Entitlement Quickstart
    
    Row Level Entitlements
    
    Package Entitlements
  - Security and Authentication
    Security and Authentication
    
    User Authentication and Authorization
    User Authentication and Authorization
    
    Overview
    
    Managing Groups
    
    Managing Service Accounts
    
    Managing Users
    
    Encryption of data in transit
    
    Data at rest encryption
    
    Shared Keycloak instance
    
    Keycloak backup and restore
  - Configuration
    Configuration
    
    Overview
    
    Setup
    
    Security
    
    Resources
    
    Availability
    
    Observability
    
    Storage
    
    Database
    
    RT archival
    
    Stream Processor
    
    Advanced
    Advanced
    
    Password policy
    
    Overprovisioning
  - Observability
    Observability
    
    Overview
    
    Logging
    
    Observability Logs
    
    Monitoring
    Monitoring
    
    Overview
    
    Metrics reference
    
    Health
    
    Alerts reference
    
    Dashboard reference
    
    Example stack
- Develop
  Develop
  - REST API
  - Packaging
    Packaging
    
    Package Object Reference
    
    Dependencies
    
    Overlays & Patches
    
    Q API
    
    Python API
    
    Open API
  - Stream Processor
  - Machine Learning
  - Language interfaces
    Language interfaces
    
    Overview
  - Extensions
    Extensions
    
    Visual Studio Code Extension
- Glossary
- Release notes
  Release notes
KDB.AI
PyKX
APIs
APIs
- Overview
- OpenAPI
  OpenAPI
  - Open API
  - q client generation
- Packages
  Packages
  - Overview
  - q Interface
    q Interface
    
    Q API
    
    Packages
    
    User-Defined Functions
  - Python Interface
    Python Interface
    
    Python API
    
    Packages
    
    User Defined Functions
  - Open API
- Database
  Database
  - Overview
  - Interface
    Interface
    
    Overview
    
    Header
    
    Codes
  - Query
    Query
    
    Overview
    
    Get Data
    
    Get Meta
    Get Meta
    
    Get Meta
    
    Get Meta v2
    
    Get Meta v3
    
    Ping
    
    QSQL
    
    SQL
    
    SQL2
    SQL2
    
    SQL2
    
    SQL2 Select Statements
    
    SQL2 Functions and Operators
    
    Preview
  - User Defined Analytics (UDAs)
    User Defined Analytics (UDAs)
    
    User Defined Analytics Overview
    
    How to
    How to
    
    Overview
    
    Creating UDAs
    
    Testing UDAs
    
    Packaging UDAs
    
    Deploying UDAs
    
    Troubleshooting & FAQs
    
    Best Practices
    
    Helper Functions
    
    Codes
    
    Publishing
    
    Example UDAs
  - OpenAPI
    OpenAPI
    
    Overview
    
    Service Gateway
    
    Resource Coordinator
    
    Aggregator
    
    Data Access
    
    Storage Manager
- Reliable Transport
  Reliable Transport
  - Overview
  - APIs
    APIs
    
    Archiver log history
    
    Hard reset
    
    Latest output position
    
    RT clients
    
    Soft reset
  - OpenAPI
    OpenAPI
    
    Worker
- Stream Processor
  Stream Processor
  - Stream Processor
  - Configuring Operators
  - General
  - Lifecycle
  - Operators
  - Readers
  - Decoders
  - Encoders
  - Transform
  - Stats
  - State
  - String Utilities
  - Windows
  - Writers
  - Machine Learning
  - User-Defined Functions
  - Object Reference
    Object Reference
    
    q
    
    Python
  - OpenAPI
    OpenAPI
    
    Coordinator
    
    Controller
    
    Worker
- Streaming
  Streaming
  - Web-sockets
    Web-sockets
    
    Overview
    
    Quickstart
    
    Client protocol
- kdb Insights Python API
  kdb Insights Python API
- Machine Learning
  Machine Learning
  - Machine Learning
  - q Interface
    q Interface
    
    About
    
    Analytics
    Analytics
    
    About
    
    ML Analytics API
    ML Analytics API
    
    Introduction
    
    ML Toolkit
    
    Online Models
    Online Models
    
    Introduction
    
    Stochastic Gradient Descent
    Stochastic Gradient Descent
    
    Stochastic Gradient Descent
    
    Linear Regression
    
    Logistic Classification
    
    Secure Updates
    
    Sequential K Means
    
    Variadic Functionality
    Variadic Functionality
    
    Introduction
    
    Function Calls
    
    Clustering models
    
    Statistical models
    
    Time series models
    
    Online models
    
    Registry
    Registry
    
    About
    
    Cloud Integration
    
    Registry API
    Registry API
    
    Storing
    
    Loading
    
    Deleting
    
    Examples
    Examples
    
    Basic Examples
  - Python Interface
    Python Interface
    
    About
    
    Registry
    Registry
    
    About
    
    Cloud Integration
    
    Registry API
    Registry API
    
    Storing
    
    Loading
    
    Deleting
    
    Examples
    Examples
    
    Basic Python Examples
Accelerators
Accelerators
- kdb Accelerators
- FSI Accelerators
  FSI Accelerators
  - FSI Acclerators Overview
  - ICE Order Book
    ICE Order Book
    
    Order Book Overview
    
    Order Book Quickstart
    
    Ingestion
    
    Order Book Data
    
    Release Notes
  - ICE FI Screener
    ICE FI Screener
    
    ICE Fixed Income FI Screener Overview
    
    ICE Fixed Income Quickstart Guide
    
    ICE Fixed Income Data Ingestion
    
    ICE Fixed Income Historic Data
    
    Backfilling Historical Data
    
    Scheduled Bar Generation configuration
    
    Manual Bar Generation configuration
    
    Release Notes
  - ICE Equities Analytics
    ICE Equities Analytics
    
    ICE Equities Analytics Overview
    
    ICE Equities Analytics Quickstart
    
    ICE Market Data Ingestion
    
    Order Ingestion
    
    Nightly Analytics Generation
    
    getOrderAnalyticSummary API
    
    generateOrderAnalytics API
    
    Adding Custom Order Analytics
    
    Order Analytics Utility Functions
    
    Developing New Order Analytics
    
    Configuring Analytics Using Prevailing Values
    
    Release Notes
  - Bloomberg Equities Analytics
    Bloomberg Equities Analytics
    
    Bloomberg Equities Analytics Overview
    
    Bloomberg Equities Analytics Quickstart
    
    Bloomberg Market Data Ingestion
    
    Order Ingestion
    
    Nightly Analytics Generation
    
    getOrderAnalyticSummary API
    
    generateOrderAnalytics API
    
    Adding Custom Order Analytics
    
    Order Analytics Utility Functions
    
    Developing New Order Analytics
    
    Configuring Analytics Using Prevailing Values
    
    Release Notes
  - Bloomberg BPIPE
    Bloomberg BPIPE
    
    Bloomberg BPIPE Overview
    
    Bloomberg BPIPE Quickstart
    
    Bloomberg BPIPE Feed Install and Customization
    
    Bloomberg EMRS Feed Install and Customization
    
    Bloomberg to Insights Enterprise user map
    
    Bloomberg entitlements filter
    
    Release Notes
  - KX Flow
    KX Flow
    
    KX Flow Overview
    
    KX Flow Quickstart
    
    Ingestion
    
    API Features
    
    Release Notes
- FSI Library
  FSI Library
  - FSI Overview
  - APIs
    APIs
    
    getTicks
    
    getStats
    getStats
    
    Overview
    
    Customize getStats
    
    getBars
    getBars
    
    Overview
    
    Customize getBars data
    
    Asset specific functionality
    Asset specific functionality
    
    Overview
    
    Futures
    
    Cancellations & Corrections
  - API configuration
  - Error & Exceptions Glossary
  - Extending Accelerator APIs
  - Release Notes
- Getting Started
- Deployment
  Deployment
- Ingesting Data
  Ingesting Data
  - Ingesting Data
- ICE Feed Handler
  ICE Feed Handler
- FIX Feed Handler
  FIX Feed Handler
Help

Late Data Best Practices

When setting up your kdb Insights Database for late data, there are things to consider beyond the "how to enable". Here are some common pitfalls and tips to consider when setting up for late data.

1) Ensure that the local DAPs have access to enough memory to store late data.

When late data is enabled, the IDB and HDB will store in-purview data received from the stream in memory until the next EOX event that allows them to purge it and read it from disk when needed. To do this, they will need enough RAM available to keep the data in memory, while still being able to service queries.

An important point to keep in mind when estimating the memory required is whether the system is configured with single mount DAPs or multi mount DAPs, see query configuration for more details. To size appropriately, you need to know the ingestion rate and expectations on how old the data ingested is. The RDB will hold data ingested since the last EOI, the IDB will hold in memory data ingested with timestamps between the last EOD and the last EOI, and HDB will hold data in memory data that has a timestamp older than the last EOD time.

In cases where the ingestion rate is known but the time range of the data is unknown or varies significantly, the multi mount DAP may be easier to size since you can size the whole container that encapsulates all mounts and not worry about which particular DAP the data is in.

2) Set pctMemThreshold such that RDB and IDB can react to an unexpected flood of data.

The pctMemThreshold is a number between 0 and 1, representing how much the DAP should allow table records to occupy its in-memory cache. This pctMemThreshold is converted to a record count maxRecordIntv the DAP expects it can ingest before hitting that cap. When the DAP has ingested maxRecordIntv records within an interval, then an emergency EOI will be triggered to save the process from running out of memory.

3) In a situation where there is a large influx of HDB purview data, manually trigger emergency EODs.

If the amount of late data ingested during the day exceeds the available HDB memory, an early EOD writedown needs to be triggered manually, possibly more than once. If this is not done (or if done too late), the HDB enters low memory mode and will not ingest any additional data until the next reload. When in this state, queries to the HDB return an AC code of .kxi.response.ac.MEMORY, and the ai contains information about the number of records that were ignored while in this low memory state.

4) If setting up an object storage tier, ensure that data is never as late as the data in the object tier.

Currently the Storage Manager does not support the writedown of late data updates to an object storage tier, so any data ingested that is destined for the object tier will be unqueryable after the next EOD.