Skip to content

kdb products

Overview

kdb products

Home
kdb+ and q
kdb Insights
kdb Insights
- About
- Free Trial
- Prerequisites
- Core
  Core
  - About
  - Install
  - Object storage
    Object storage
    
    About
    
    Quickstart
    
    Caching
    
    Examples
  - SQL
    SQL
    
    About
    
    SQL Reference
    SQL Reference
    
    Operators
    
    Functions
    
    Data and Literals
    
    Select Statements
    
    Table Creation
    
    ANSI SQL Compliance
  - Postgres SQL Interface
  - REST API
    REST API
    
    Client
    Client
    
    About
    
    Quickstart
    
    Workflows
    
    Examples
    Examples
    
    Async
    
    Follow redirects
    
    Response headers
    
    Timeouts
    
    Azure API Management
    
    GCP Identity Aware Proxy
    
    Server
    Server
    
    About
    
    Quickstart
    
    API reference
    
    Examples
    Examples
    
    customers
    
    queryclient
    
    queryserver
    
    queryworker
    
    OpenAPI Sample
  - Google BigQuery API
    Google BigQuery API
    
    About
    
    Quickstart
    
    Main
    
    Discovery
    
    Query
    
    Projects
    
    Datasets
    
    Tables
    
    Tabledata
    
    Helpers
    
    Configuration
    
    API
    
    Troubleshooting
  - Packaging
    Packaging
    
    About
    
    Quickstart
    
    Examples
    Examples
    
    About the examples
    
    Basic Tick
    
    Hello C
    
    Labelling
  - Logging
    Logging
    
    About
    
    Quickstart
    
    API reference
  - Release notes
    Release notes
    
    Latest
    
    Previous
- Database
  Database
  - Overview
  - Data configuration
    Data configuration
    
    Overview
    
    Routing
    
    Assembly
    Assembly
    
    Database
    
    Schema
    
    Storage
    
    Query
    
    Stream
    
    Aggregation
    
    Advanced
    Advanced
    
    Overview
    
    Query scaling
    
    Authorization
    Authorization
    
    Custom IPC Authorization
    
    Custom HTTP Authorization
  - Data storage
    Data storage
    
    Overview
    
    Storage tiering
    
    Initial import
    
    Batch ingest
    
    Delete
  - Data query
    Data query
    
    Overview
    
    Purviews
    
    Scope
    
    Late data
    
    Reference data
    
    Routing
    
    Queueing, retries, and timeouts
    
    Resilience
    
    Logging
    
    Troubleshooting
    
    Advanced
    Advanced
    
    Query existing object storage
  - Querying methods
    Querying methods
    
    REST vs QIPC
    
    SQL
    
    Custom APIs
  - Monitoring
  - Best practices
    Best practices
    
    Late Data
    
    Performance
  - Deploying
    Deploying
    
    Overview
    
    Docker
    Docker
    
    Basic
    
    Metrics
    
    Kubernetes
    Kubernetes
    
    Service Gateway
  - Upgrading
  - Downgrading
  - Glossary
- Stream Processor
  Stream Processor
  - About Streaming Data
  - Quickstart
    Quickstart
    
    Docker
    
    Kubernetes
  - Writing
  - Running
  - Configuration
  - Examples
    Examples
    
    Static file
    
    Batch S3 ingestion
    
    Streaming Kafka ingestion
    
    Kafka with TLS
    
    PostgreSQL Querying
    
    Pipeline Replicas
    
    Stateful operators
    
    Enriching streams
    
    Windowing on event time
    
    Windowing on processing time
    
    kdb+ tick (callback)
  - Concepts
    Concepts
    
    Checkpoints and recovery
    
    Determinism
    
    Glob patterns
    
    Scaling
    
    State
- Reliable Transport
  Reliable Transport
  - About
  - Quickstart
    Quickstart
    
    About
    
    Docker
    
    Kubernetes
  - Publishers
    Publishers
    
    Overview
    
    RT Bridge
  - Subscribers
  - Interfaces
    Interfaces
    
    Getting started
    
    C
    C
    
    Using the C interface
    
    C samples
    
    Java
    Java
    
    Using the Java interface
    
    Java samples
    
    Python
    Python
    
    Using the Python interface
    
    Python samples
    
    q (rt.qpk)
  - Examples
    Examples
    
    Publishing to Enterprise using q
    
    Recovering archived logs
  - Configuration
    Configuration
    
    Overview
    
    Diagnostics
    
    Monitoring
  - Administration
    Administration
    
    Soft reset
    
    Hard reset
- Release notes
  Release notes
  - Latest
  - Previous
- Extras
  Extras
  - Tutorials
    Tutorials
    
    Streaming to a web-socket client
  - Machine Learning
    Machine Learning
    
    About
    
    Quickstart
    Quickstart
    
    Docker
    
    Kubernetes
    
    Examples
    Examples
    
    Model Generation & Deployment
kdb Insights Enterprise
kdb Insights Enterprise
- Home
- About
- Architecture
- Install
  Install
  - Overview
  - Free Trial
    Free Trial
    
    Overview
    
    Product tour
  - Azure Marketplace
    Azure Marketplace
    
    Offers
    
    KX Managed
    KX Managed
    
    About
    
    Installing
    Installing
    
    Prerequisites
    Prerequisites
    
    Prerequisites
    
    Permissions
    
    User Node Pool Sizing
    
    Install
    
    Login
    
    Potential issues
    
    Billing
    Billing
    
    About
    
    FAQ
    
    Security
    
    Licensing
    
    Release Notes
    Release Notes
    
    Latest
    
    Previous
    
    License only
    
    Private offers
    
    Azure Integrations
    Azure Integrations
    
    Azure Data Factory
    
    Azure Active Directory
    Azure Active Directory
    
    Azure Active Directory
    
    AAD Keycloak Composite Roles
    
    Azure Monitoring
    Azure Monitoring
    
    Alert Configuration
    
    Workbook Configuration
    
    Azure PowerBI
    
    Support
    Support
    
    KX Support
    
    Azure Secrets
  - Standalone
    Standalone
    
    Infrastructure
    Infrastructure
    
    Managed K8S
    Managed K8S
    
    Prerequisites
    
    Terraform
    Terraform
    
    Deployment
    Deployment
    
    Overview
    
    Cloud provider
    Cloud provider
    
    GCP
    
    AWS
    
    Azure
    
    Interacting with Kubernetes
    
    DNS setup
    
    On-Prem OpenShift
    On-Prem OpenShift
    
    Prerequisites
    
    Installation
    Installation
    
    Installing
    
    Upgrading
    
    Air-gapped environments
- Use
  Use
  - Get started
    Get started
    
    Overview
    
    User Interface
    User Interface
    
    Log in
    
    UI Overview
    
    System Information
    
    Guided walkthroughs
    Guided walkthroughs
    
    Index
    
    Ingest and Query
    Ingest and Query
    
    Database
    
    Object Storage
    
    Kafka
    
    SQL Database
    
    Protocol Buffer
    
    Query
    
    Visualize
    Visualize
    
    Build a View
    
    Maps
    
    Streaming
    
    Diagnostics
    
    Industry tutorials
    Industry tutorials
    
    Index
    
    Finance
    Finance
    
    Backtest trading strategies
    
    Run ML model in real-time
    
    Kafka
    
    Manufacturing
    
    Parquet
  - Configuring a database
    Configuring a database
    
    Overview
    
    Configuration options
    Configuration options
    
    Overview
    
    Routing
    
    User interface
    User interface
    
    Database Admin
    
    Database Settings
    
    Schema Settings
    
    Stream Settings
    
    Database Resources
    
    Assembly
    Assembly
    
    Database
    
    Schema
    
    Storage
    
    Routing
    
    Query
    
    Stream
    
    Reference
    
    Aggregation
    
    Advanced
    Advanced
    
    Overview
    
    Query scaling
    
    Deploying
    
    Monitoring
    
    Best practices
    Best practices
    
    Late data
    
    Performance
    
    Upgrading
    
    Glossary
  - Storing Data
    Storing Data
    
    Overview
    
    Storage tiering
    
    Object storage
    
    Initial import
    
    Batch ingest
  - Ingest & transform
    Ingest & transform
    
    Overview
    
    Import wizard
    
    Pipelines
    Pipelines
    
    Overview
    
    Test
    
    Deploy
    
    Operators
    Operators
    
    Overview
    
    Readers
    
    Writers
    
    Functions
    
    Decoders
    
    Encoders
    
    Transform
    
    Stats
    
    Windows
    
    Machine Learning
    
    String Utilities
    
    Troubleshooting
    
    Examples
    Examples
    
    Overview
    
    Kafka
    Kafka
    
    Getting started
    
    Setup Kafka
    
    Basic ingestion
    
    Setup Kafka TLS
    
    Enabling TLS ingestion
    
    Fitting Machine Learning model on Kafka data
    
    PostgreSQL query
    PostgreSQL query
    
    Getting started
    
    Setup PostgreSQL
    
    Querying PostgreSQL
    
    S3
    S3
    
    Batch S3 ingest
    
    Fitting Machine Learning model on S3 data
    
    Using language interfaces
  - Querying data
    Querying data
    
    Overview
    
    Purviews
    
    Scope
    
    Late data
    
    Reference data
    
    Routing
    
    Queueing, retries and timeouts
    
    Query methods
    Query methods
    
    UI
    
    REST vs QIPC
    
    SQL
    
    Custom APIs
    
    Java interface
    
    Resilience
    
    Logging
    
    Troubleshooting
  - Packaging
    Packaging
    
    Introduction
    
    Quickstart
    
    Components
    
    Command line interface
    
    Package Lifecycle
    Package Lifecycle
    
    Creating a Package
    
    Editing a package
    
    Uploading a package
    
    Deploying a package
    
    Using a package
    
    Code and Dependencies
    Code and Dependencies
    
    Entrypoints
    
    UDFs
    
    Dependencies
    
    Overlays & Patches
    
    Using package code locally
    
    Package Object Reference
  - Analysing your data
    Analysing your data
    
    Overview
    
    Using the UI Query window
    Using the UI Query window
    
    Querying databases
    
    Developing using q
    
    Developing using Python
  - Visualizing Data
    Visualizing Data
    
    Views Overview
    
    Quickstart guide to Views
    
    Guide to building Views
  - Diagnostics
    Diagnostics
    
    Diagnosing deployments
- Administer
  Administer
  - Command line interface
    Command line interface
    
    Overview
    
    Installing the CLI
    
    Configuration
    
    Authentication
    
    Backup and Restore
    
    Reference
  - Assembly Deployment
    Assembly Deployment
    
    Overview
    
    Building
    
    Deploying
    
    Upgrading
  - Data Entitlements
    Data Entitlements
    
    Overview
    
    Prerequisites
    
    Quickstart
    
    Configuration
  - Security and authentication
    Security and authentication
    
    User Authentication and authorization
    User Authentication and authorization
    
    Overview
    
    Managing groups
    
    Managing service accounts
    
    Managing users
    
    Data Encryption in Transit
    
    Shared Keycloak instance
    
    Keycloak backup and restore
  - Configuration
    Configuration
    
    Overview
    
    Setup
    
    Security
    
    Resources
    
    Availability
    
    Observability
    
    Storage
    
    Database
    
    RT archival
    
    Stream Processor
    
    Advanced
    Advanced
    
    Password policy
    
    Overprovisioning
  - Observability
    Observability
    
    Overview
    
    Logging
    
    Monitoring
    Monitoring
    
    Overview
    
    Metrics reference
    
    Alerts reference
    
    Dashboard reference
- Develop
  Develop
  - REST API
  - Packaging
  - Stream Processor
  - Machine Learning
  - Language interfaces
    Language interfaces
    
    Overview
    
    RT Bridge
  - Extensions
    Extensions
    
    Visual Studio Code Extension
- Glossary
- Release notes
  Release notes
KDB.AI
PyKX
APIs
APIs
- Overview
- OpenAPI
  OpenAPI
  - Introduction
  - q client generation
- Packages
  Packages
  - Overview
  - q Interface
    q Interface
    
    Overview
    
    Packages
    
    User-Defined Functions
  - Python Interface
    Python Interface
    
    Overview
    
    Packages
    
    User Defined Functions
  - Open API
- Database
  Database
  - Overview
  - Interface
    Interface
    
    Overview
    
    Header
    
    Codes
  - Query
    Query
    
    Overview
    
    Get Data
    
    Get Meta
    
    Ping
    
    QSQL
    
    SQL
    
    SQL2
    SQL2
    
    SQL2
    
    SQL2 Select Statements
    
    SQL2 Functions and Operators
    
    Preview
  - Custom APIs
    Custom APIs
    
    Overview
    
    Metadata
    
    Publish
    
    Registration
    
    Helper Functions
  - OpenAPI
    OpenAPI
    
    Overview
    
    Service Gateway
    
    Resource Coordinator
    
    Aggregator
    
    Data Access
    
    Storage Manager
- Reliable Transport
  Reliable Transport
  - Overview
  - APIs
    APIs
    
    Archiver log history
    
    Hard reset
    
    Latest output position
    
    RT clients
    
    Soft reset
  - OpenAPI
    OpenAPI
    
    Worker
- Stream Processor
  Stream Processor
  - Overview
  - q Interface
    q Interface
    
    Overview Overview
    On this page
    
    Operator syntax
    
    Implicit last argument
    
    Implicit penultimate argument
    
    Order of evaluation
    
    Configuring Operators
    
    General
    
    Lifecycle
    
    Operators
    
    Data Structures
    
    Readers
    
    Decoders
    
    Encoders
    
    Data Transforms
    
    Stats
    
    State
    
    String
    
    Windows
    
    Writers
    
    User-Defined Functions
    
    Machine Learning
  - Python Interface
    Python Interface
    
    Overview
    
    General
    
    Lifecycle
    
    Operators
    
    Readers
    
    Decoders
    
    Encoders
    
    Data Transforms
    
    Stats
    
    State
    
    String
    
    Windows
    
    Writers
    
    Machine Learning
  - OpenAPI
    OpenAPI
    
    Coordinator
    
    Controller
    
    Worker
- Streaming
  Streaming
  - Web-sockets
    Web-sockets
    
    Client protocol
- kdb Insights Python API
  kdb Insights Python API
- Machine Learning
  Machine Learning
  - About
  - q Interface
    q Interface
    
    About
    
    Analytics
    Analytics
    
    About
    
    ML Analytics API
    ML Analytics API
    
    Introduction
    
    ML Toolkit
    
    Online Models
    Online Models
    
    Introduction
    
    Stochastic Gradient Descent
    Stochastic Gradient Descent
    
    Stochastic Gradient Descent
    
    Linear Regression
    
    Logistic Classification
    
    Secure Updates
    
    Sequential K Means
    
    Variadic Functionality
    Variadic Functionality
    
    Introduction
    
    Function Calls
    
    Clustering models
    
    Statistical models
    
    Time series models
    
    Online models
    
    Registry
    Registry
    
    About
    
    Cloud Integration
    
    Registry API
    Registry API
    
    Storing
    
    Loading
    
    Deleting
    
    Examples
    Examples
    
    Basic Examples
  - Python Interface
    Python Interface
    
    About
    
    Registry
    Registry
    
    About
    
    Cloud Integration
    
    Registry API
    Registry API
    
    Storing
    
    Loading
    
    Deleting
    
    Examples
    Examples
    
    Basic Python Examples
Licensing
Licensing
Accelerators
Accelerators
- kdb Accelerators
- Overview
- Accelerators
  Accelerators
  - ICE Order Book
    ICE Order Book
    
    ICE Overview
    
    ICE Order Book Quickstart Guide
    
    Ingestion
    
    Order Book Data
    
    Release Notes
- FSI Library
  FSI Library
  - FSI Overview
  - APIs
    APIs
    
    getTicks
    
    getStats
    getStats
    
    Overview
    
    Customize getStats
    
    getBars
    getBars
    
    Overview
    
    Customize getBars data
    
    Asset specific functionality
    Asset specific functionality
    
    Overview
    
    Futures
    
    FX
    
    Cancellations & Corrections
  - API configuration
  - Error & Exceptions Glossary
  - Release Notes
- Getting Started
- Deployment
  Deployment
- Ingesting Data
  Ingesting Data
  - Ingesting Data
Help

Stream Processor q API

.qsp.

Configuring operators use modify behavior of an operator

General push publish data to all downstream operators run install and run a pipeline teardown tear down a pipeline configPath return the path of mounted configuration files getPartitions return the current assigned partitions getPartitionCount return the count of all partitions setTrace enables trace logging clearTrace clears trace logging and resets logging level enableDataTracing captures data flowing in a pipeline disableDataTracing disables data tracing in a pipeline resetDataTrace resets the current trace data cache clearDataTrace resets the current trace data cache (deprecated) getDataTrace returns a point-in-time data trace capture setRecordCounting (Beta) sets the level for tracking dataflow in a pipeline resetRecordCounts (Beta) resets the current record counts cache getRecordCounts (Beta) returns information on the amount of dataflow

Lifecycle finish call finish on an operator finishTask mark a task as finished onError set the onError event handler onCheckpoint set the onCheckpoint handler onOperatorCheckpoint set the onCheckpoint event handler onOperatorPostCheckpoint set the onPostCheckpoint event handler onOperatorRecover set the onRecover event handler onPostCheckpoint set the onPostCheckpoint handler onRecover set the onRecover handler onSetup set the onSetup event handler onStart set the onStart event handler onFinish set the onFinish event handler onTeardown set the onTeardown event handler registerTask register a task for an operator subscribe add a subscriber for an event unsubscribe remove a subscriber or all subscribers

Operators accumulate aggregates a stream into an accumulator apply apply a function to incoming batches in the stream filter filter some or all elements from a batch keyBy (Beta) keys a stream on a value in the stream map apply a function to data passing through the operator merge merge two data streams parallel (Beta) applies multiple functions in parallel over a stream reduce aggregate partial windows rolling (Beta) a moving-window function to a stream split split the current stream sql execute an SQL query on tables in a stream union unite two streams

Readers read.fromAmazonS3 reads data from Amazon Web Services S3 buckets read.fromAzureStorage reads data from Azure Blob Storage read.fromCallback define callback in the q global namespace read.fromDatabase reads data from an Insights database read.fromExpr evaluate expression or function into the pipeline read.fromFile read file contents into pipeline read.fromGoogleStorage reads data from Google Cloud Storage read.fromHTTP requests data from an HTTP(S) endpoint read.fromKafka consume data from a Kafka topic read.fromMQTT subscribe to an MQTT topic read.fromParquet read Apache Parquet data from a cloud registry read.fromPostgres execute a query against a PostgreSQL database read.fromSQLServer execute a query against a SQL Server database read.fromStream read data using a kdb Insights stream read.fromUpload read data supplied through an HTTP endpoint

Decoders decode.arrow (Beta) decode Arrow streams decode.csv parse CSV data to a table decode.gzip (Beta) decode gzipped data decode.json parse JSON data decode.pcap decode pcap data decode.protobuf parse Protocol Buffer messages

Encoders arrow (Beta) encode a stream as Arrow data csv encode tables as CSV data encode.json encode data in JSON format encode.protobuf encode Protocol Buffer messages

State get cached state of an operator set store state of an operator

Stats (Beta) describe calculate specific statistics ema calculate an exponential moving average sma calculate a simple moving average twa calculate a time weighted average

String string.toUpperCase uppercases specified incoming data string.toLowerCase lowercases specified incoming data

Transform transform.fill (Beta) fill in null values in a table transform.renameColumns renames columns in the incoming data transform.replaceInfinity replaces infinite values with min/max values transform.replaceNull replaces null values with the median value transform.schema transforms data to match a provided schema transform.timeSplit decomposes time columns into subdivisions of mins/hours etc.

Windows window.count aggregate stream into evenly sized windows window.global aggregate stream using a custom trigger function window.sliding aggregate stream in potentially overlapping windows window.timer aggregate stream by processing time window.tumbling aggregate stream into non-overlapping windows

Writers write.toAmazonS3 write to an object in Amazon S3 write.toConsole write to the console write.toDatabase write to kdb Insights Database write.toKafka publish data on a Kafka topic write.toKDB (Beta) write data to an on-disk partitioned table write.toProcess write data to a kdb+ process write.toStream write data using a kdb Insights stream write.toSubscriber write data to subscribers write.toVariable write to a local variable

Machine Learning

Fresh ml.freshCreate turns batches of data into features based on aggregated statistics

Classification ml.adaBoostClassifier fits an adaBoost classification model ml.decisionTreeClassifier fits a decision tree classification model ml.gaussianNB fits a gaussian naive bayes model ml.kNeighborsClassifier fits a k-nearest neighbors classification model ml.logClassifier fits a logistic classification model using stochastic gradient descent ml.quadraticDiscriminantAnalysis fits a quadratic discriminant analysis model ml.randomForestClassifier fits a random forest classification model

Clustering ml.affinityPropagation fits an affinity propagation clustering model ml.birch fits a BIRCH clustering model ml.cure fits a CURE clustering model ml.dbscan fits a DBSCAN clustering model ml.sequentialKMeans fits a sequential k-means model

Regression ml.adaBoostRegressor fits an adaBoost regression model ml.gradientBoostingRegressor fits a gradient boosting regression model ml.kNeighborsRegressor fits a k-nearest neighbors regression model ml.lasso fits a lasso-linear regression model ml.linearRegression fits a linear regression model ml.randomForestRegressor fits a random forest regression model

Metrics ml.score evaluates a model's predictions

Preprocessing ml.dropConstant drops constant columns from incoming data ml.featureHasher encodes categorical data as numeric vectors ml.labelEncode encodes symbolic data into numerical values ml.minMaxScaler min-max scale a supplied dataset ml.oneHot replaces symbolic values with numerical vector representations ml.standardize standardizes a supplied dataset

Registry ml.registry.fit fits a model to batches of data, saving a model to a registry ml.registry.predict predicts a target variable using a trained model from the registry ml.registry.update trains a model incrementally, returning predictions for all records

Operator syntax

Pipeline API operators are designed to be chained, as in the examples, in a form that will be familiar to users of libraries such as jQuery.

APIs are executed from left to right (or top to bottom when newlines are added) and are designed to be composed for human readability. For example, in the following case, data would be read from Kafka, transformed through a JSON decoder, windowed and written to an Insights stream.

.qsp.run
    .qsp.read.fromKafka[`trades]
    .qsp.decode.json[]
    .qsp.window.tumbling[00:00:05; `time]
    .qsp.write.toStream[]

Implicit last argument

Each .qsp operator returns an object representing a ‘node’ configuration; and takes a node configuration as its last argument. That last argument is left implicit in the API documentation: each operator therefore has a rank higher than documented.

Pipeline API operators are invoked as projections on their implicit last arguments and therefore must be applied using bracket notation only, never prefix.

Implicit penultimate argument

Many Pipeline API operators are variadic; most take a penultimate argument of custom configuration options.

If .qsp.foo is a Pipeline API operator that takes arguments x and y and optionally cco a dictionary of custom configuration options, its true signatures are

.qsp.foo[x;y;node]
.qsp.foo[x;y;cco;node]

To limit duplication in the documentation, neither the cco nor the node arguments are documented. The signature would be documented as

.qsp.foo[x;y]

An important consequence is that

.qsp.foo[x;y]
.qsp.foo[x;y;cco]

are both unary projections.

Order of evaluation

Successive Pipeline API operators modify the node configuration and pass it along the chain for eventual evaluation, reversing the apparent evaluation order.

In

prd
  {x where x>5}
  til 8

q evaluates first til 8, then the lambda, then prd, but in

.qsp.run
  .qsp.read.fromExpr["refData"]
  .qsp.write.toConsole[]

because actual evaluation is handled by .qsp.run, the table refData is read before its contents are written to the console.