Skip to content

kdb products

Overview

kdb products

Home
kdb+ and q
kdb Insights
kdb Insights
- About
- Free Trial
- Prerequisites
- Core
  Core
  - About
  - Install
  - Object storage
    Object storage
    
    About
    
    Quickstart
    
    Caching
    
    Examples
  - SQL
    SQL
    
    About
    
    SQL Reference
    SQL Reference
    
    Operators
    
    Functions
    
    Data and Literals
    
    Select Statements
    
    Table Creation
    
    ANSI SQL Compliance
  - Postgres SQL Interface
  - REST API
    REST API
    
    Client
    Client
    
    About
    
    Quickstart
    
    Workflows
    
    Examples
    Examples
    
    Async
    
    Follow redirects
    
    Response headers
    
    Timeouts
    
    Azure API Management
    
    GCP Identity Aware Proxy
    
    Server
    Server
    
    About
    
    Quickstart
    
    API reference
    
    Examples
    Examples
    
    customers
    
    queryclient
    
    queryserver
    
    queryworker
    
    OpenAPI Sample
  - Google BigQuery API
    Google BigQuery API
    
    About
    
    Quickstart
    
    Main
    
    Discovery
    
    Query
    
    Projects
    
    Datasets
    
    Tables
    
    Tabledata
    
    Helpers
    
    Configuration
    
    API
    
    Troubleshooting
  - Packaging
    Packaging
    
    About
    
    Quickstart
    
    Examples
    Examples
    
    About the examples
    
    Basic Tick
    
    Hello C
    
    Labelling
  - Logging
    Logging
    
    About
    
    Quickstart
    
    API reference
  - Release notes
    Release notes
    
    Latest
    
    Previous
- Database
  Database
  - Overview
  - Data configuration
    Data configuration
    
    Overview
    
    Routing
    
    Assembly
    Assembly
    
    Database
    
    Schema
    
    Storage
    
    Query
    
    Stream
    
    Aggregation
    
    Advanced
    Advanced
    
    Overview
    
    Query scaling
    
    Authorization
    Authorization
    
    Custom IPC Authorization
  - Data storage
    Data storage
    
    Overview
    
    Storage tiering
    
    Initial import
    
    Batch ingest
  - Data query
    Data query
    
    Overview
    
    Purviews
    
    Scope
    
    Late data
    
    Reference data
    
    Routing
    
    Queueing, retries, and timeouts
    
    Resilience
    
    Logging
    
    Troubleshooting
    
    Advanced
    Advanced
    
    Query existing object storage
  - Querying methods
    Querying methods
    
    REST vs QIPC
    
    SQL
  - Monitoring
  - Best practices
    Best practices
    
    Late Data
    
    Performance
  - Deploying
    Deploying
    
    Overview
    
    Docker
    Docker
    
    Basic
    
    Metrics
  - Upgrading
  - Glossary
- Stream Processor
  Stream Processor
  - About Streaming Data
  - Quickstart
    Quickstart
    
    Docker
    
    Kubernetes
  - Writing
  - Running
  - Configuration
  - Examples
    Examples
    
    Static file
    
    Batch S3 ingestion
    
    Streaming Kafka ingestion
    
    Kafka with TLS
    
    PostgreSQL Querying
    
    Stateful operators
    
    Enriching streams
    
    Windowing on event time
    
    Windowing on processing time
    
    kdb+ tick (callback)
  - Concepts
    Concepts
    
    Checkpoints and recovery
    
    Glob patterns
    
    Scaling
    
    State
- Reliable Transport
  Reliable Transport
  - About
  - Quickstart
    Quickstart
    
    About
    
    Docker
    
    Kubernetes
  - Publishers
    Publishers
    
    Overview
    
    RT Bridge
  - Subscribers
  - Interfaces
    Interfaces
    
    Getting started
    
    C
    C
    
    Using the C interface
    
    C samples
    
    Java
    Java
    
    Using the Java interface
    
    Java samples
    
    q (rt.qpk)
  - Examples
    Examples
    
    Publishing to Enterprise using q
    
    Recovering archived logs
  - Configuration
    Configuration
    
    Overview
    
    Diagnostics
    
    Monitoring
  - Administration
    Administration
    
    Soft reset
    
    Hard reset
- Service Discovery
  Service Discovery
  - About
  - Prerequisites
  - Running
  - Configuration
  - OpenAPI
- Release notes
  Release notes
  - Latest
  - Previous
- Extras
  Extras
  - Machine Learning
    Machine Learning
    
    About
    
    Quickstart
    Quickstart
    
    Docker
    
    Kubernetes
    
    Examples
    Examples
    
    Model Generation & Deployment
kdb Insights Enterprise
kdb Insights Enterprise
- Home
- About
- Architecture
- Install
  Install
  - Overview
  - Free Trial
  - Azure Marketplace
    Azure Marketplace
    
    Offers
    
    KX Managed
    KX Managed
    
    About
    
    Installing
    Installing
    
    Prerequisites
    Prerequisites
    
    Prerequisites
    
    Permissions
    
    User Node Pool Sizing
    
    Install
    
    Login
    
    Potential issues
    
    Billing
    Billing
    
    About
    
    FAQ
    
    Security
    
    Licensing
    
    Release Notes
    Release Notes
    
    Latest
    
    Previous
    
    License only
    
    Azure Integrations
    Azure Integrations
    
    Azure Data Factory
    
    Azure Active Directory
    Azure Active Directory
    
    Azure Active Directory
    
    AAD Keycloak Composite Roles
    
    Azure Monitoring
    Azure Monitoring
    
    Alert Configuration
    
    Workbook Configuration
    
    Azure PowerBI
    
    Support
    Support
    
    KX Support
    
    Azure Secrets
  - Standalone
    Standalone
    
    Infrastructure
    Infrastructure
    
    Prerequisites
    
    Terraform deployment
    Terraform deployment
    
    Overview
    
    Cloud provider
    Cloud provider
    
    GCP
    
    AWS
    
    Azure
    
    Interacting with Kubernetes
    
    DNS setup
    
    Installation
    Installation
    
    Installing
    
    Upgrading
    
    Air-gapped environments
- Use
  Use
  - Get started
    Get started
    
    Overview
    
    Log in
    
    User interface
    
    Product Tour
    
    Guided walkthrough
    Guided walkthrough
    
    About
    
    Ingest and Query
    Ingest and Query
    
    Database
    
    Object Storage (Weather)
    
    Kafka (Subway)
    
    SQL Database (Health)
    
    Protocol Buffer (Crime)
    
    Query
    
    Visualize
    Visualize
    
    Build a View
    
    Maps
    
    Troubleshooting
    
    Industry tutorials
    Industry tutorials
    
    Index
    
    Finance
    Finance
    
    Backtest trading strategies
    
    Run ML model in real-time
    
    Kafka
    
    Manufacturing
  - Configuring a database
    Configuring a database
    
    Overview
    
    Configuration options
    Configuration options
    
    Overview
    
    Routing
    
    User interface
    User interface
    
    Database
    
    Schema
    
    Stream
    
    Assembly
    Assembly
    
    Database
    
    Schema
    
    Storage
    
    Query
    
    Stream
    
    Aggregation
    
    Advanced
    Advanced
    
    Overview
    
    Query scaling
    
    Deploying
    
    Monitoring
    
    Best practices
    Best practices
    
    Late data
    
    Performance
    
    Upgrading
    
    Glossary
  - Storing Data
    Storing Data
    
    Overview
    
    Storage tiering
    
    Object storage
    
    Initial import
    
    Batch ingest
  - Ingest & transform
    Ingest & transform
    
    Overview
    
    Import wizard
    
    Pipelines
    Pipelines
    
    Overview
    
    Test
    
    Deploy
    
    Operators
    Operators
    
    Overview
    
    Readers
    
    Writers
    
    Functions
    
    Decoders
    
    Encoders
    
    Transform
    
    Windows
    
    Machine Learning
    
    String Utilities
    
    Examples
    Examples
    
    Overview
    
    Kafka
    Kafka
    
    Getting started
    
    Setup Kafka
    
    Basic ingestion
    
    Setup Kafka TLS
    
    Enabling TLS ingestion
    
    Fitting Machine Learning model on Kafka data
    
    PostgreSQL query
    PostgreSQL query
    
    Getting started
    
    Setup PostgreSQL
    
    Querying PostgreSQL
    
    S3
    S3
    
    Batch S3 ingest
    
    Fitting Machine Learning model on S3 data
    
    Using language interfaces
  - Querying data
    Querying data
    
    Overview
    
    Purviews
    
    Scope
    
    Late data
    
    Reference data
    
    Routing
    
    Queueing, retries and timeouts
    
    Query methods
    Query methods
    
    UI
    
    REST vs QIPC
    
    SQL
    
    Java interface
    
    Resilience
    
    Logging
    
    Troubleshooting
  - Adding Code and Packages
    Adding Code and Packages
    
    Introduction
    
    Usage Notes
    
    Quickstart Code
    
    Quickstart Deploy
    
    Package Development
    Package Development
    
    Create a Package
    
    Package Components
    
    Overlays & Patches
    
    Creating UDFs
    
    Packing, Installing, Locking
    
    Packages In Insights
    Packages In Insights
    
    Push, Pull, List
    
    Deployments
    
    Running Code
    
    Package Spec Reference
  - Analysing your data
    Analysing your data
    
    Overview
    
    Using the UI Explore window
    Using the UI Explore window
    
    Querying databases
    
    Developing using q
    
    Developing using Python
  - Visualizing Data
    Visualizing Data
    
    Overview
    
    Quickstart
    
    Building reports and dashboards
  - Troubleshooting
    Troubleshooting
    
    Diagnosing deployments
- Administer
  Administer
  - Command line interface
    Command line interface
    
    Overview
    
    Installing the CLI
    
    Configuration
    
    Backup and Restore
    
    Reference
  - Assembly Deployment
    Assembly Deployment
    
    Overview
    
    Building
    
    Deploying
    
    Upgrading
  - Data Entitlements
    Data Entitlements
    
    Overview
    
    Prerequisites
    
    Quickstart
    
    Configuration
  - Language interfaces
    Language interfaces
    
    Overview
    
    RT Bridge
  - Security and authentication
    Security and authentication
    
    Authentication and authorization
    Authentication and authorization
    
    Overview
    
    Managing groups
    
    Managing service accounts
    
    Managing users
    
    Shared Keycloak instance
    
    Keycloak backup and restore
  - Configuration
    Configuration
    
    Overview
    
    Setup
    
    Security
    
    Resources
    
    Availability
    
    Observability
    
    Storage
    
    Database
    
    RT archival
    
    Stream Processor
    
    Advanced
    Advanced
    
    Password policy
    
    Overprovisioning
  - Observability
    Observability
    
    Overview
    
    Logging
    
    Metrics
    
    Metrics reference
- Develop
  Develop
- Glossary
- Release notes
  Release notes
  - Latest
  - Previous
KDB.AI
PyKX
APIs
APIs
- Overview
- OpenAPI
  OpenAPI
  - Introduction
  - q client generation
- Packages
  Packages
  - Overview
  - Package Contents
  - Command Line Interface
  - q Interface
    q Interface
    
    Overview
    
    Packages
    
    User-Defined Functions
  - Python Interface
    Python Interface
    
    Overview
    
    Packages
    
    User Defined Functions
- Database
  Database
  - Overview
  - Interface
    Interface
    
    Overview
    
    Header
    
    Codes
  - Query
    Query
    
    Overview
    
    Get Data
    
    Get Meta
    
    Ping
    
    QSQL
    
    SQL
  - Custom APIs
    Custom APIs
    
    Overview
    
    Metadata
    
    Registration
  - OpenAPI
    OpenAPI
    
    Overview
    
    Service Gateway
    
    Resource Coordinator
    
    Aggregator
    
    Data Access
    
    Storage Manager
- Stream Processor
  Stream Processor
  - Overview
  - q Interface
    q Interface
    
    Overview Overview
    On this page
    
    Operator syntax
    
    Implicit last argument
    
    Implicit penultimate argument
    
    Order of evaluation
    
    Configuring Operators
    
    General
    
    Lifecycle
    
    Operators
    
    Data Structures
    
    Readers
    
    Decoders
    
    Encoders
    
    Data Transforms
    
    Stats
    
    State
    
    Windows
    
    Writers
    
    User-Defined Functions
    
    Machine Learning
  - Python Interface
    Python Interface
    
    Overview
    
    General
    
    Lifecycle
    
    Operators
    
    Readers
    
    Decoders
    
    Encoders
    
    Data Transforms
    
    State
    
    Windows
    
    Writers
    
    Machine Learning
  - OpenAPI
    OpenAPI
    
    Coordinator
    
    Controller
    
    Worker
- kdb Insights Python API
  kdb Insights Python API
- Machine Learning
  Machine Learning
  - About
  - q Interface
    q Interface
    
    About
    
    Analytics
    Analytics
    
    About
    
    ML Analytics API
    ML Analytics API
    
    Introduction
    
    ML Toolkit
    
    Online Models
    Online Models
    
    Introduction
    
    Stochastic Gradient Descent
    Stochastic Gradient Descent
    
    Stochastic Gradient Descent
    
    Linear Regression
    
    Logistic Classification
    
    Secure Updates
    
    Sequential K Means
    
    Variadic Functionality
    Variadic Functionality
    
    Introduction
    
    Function Calls
    
    Clustering models
    
    Statistical models
    
    Time series models
    
    Online models
    
    Registry
    Registry
    
    About
    
    Cloud Integration
    
    Registry API
    Registry API
    
    Storing
    
    Loading
    
    Deleting
    
    Examples
    Examples
    
    Basic Examples
  - Python Interface
    Python Interface
    
    About
    
    Registry
    Registry
    
    About
    
    Cloud Integration
    
    Registry API
    Registry API
    
    Storing
    
    Loading
    
    Deleting
    
    Examples
    Examples
    
    Basic Python Examples
Licensing
Licensing
- KX Licensing Overview
- Installing Licenses
Help

Stream Processor q API

.qsp.

Configuring operators use modify behavior of an operator

General push publish data to all downstream operators run install and run a pipeline teardown tear down a pipeline configPath return the path of mounted configuration files getPartitions return the current assigned partitions getPartitionCount return the count of all partitions setTrace enables trace logging clearTrace clears trace logging and resets logging level enableDataTracing captures data flowing in a pipeline disableDataTracing disables data tracing in a pipeline resetDataTrace resets the current trace data cache clearDataTrace resets the current trace data cache (deprecated) getDataTrace returns a point-in-time data trace capture setRecordCounting (Beta) sets the level for tracking dataflow in a pipeline resetRecordCounts (Beta) resets the current record counts cache getRecordCounts (Beta) returns information on the amount of dataflow

Lifecycle finish call finish on an operator finishTask mark a task as finished onError set the onError event handler onCheckpoint set the onCheckpoint handler onOperatorCheckpoint set the onCheckpoint event handler onOperatorPostCheckpoint set the onPostCheckpoint event handler onOperatorRecover set the onRecover event handler onPostCheckpoint set the onPostCheckpoint handler onRecover set the onRecover handler onSetup set the onSetup event handler onStart set the onStart event handler onFinish set the onFinish event handler onTeardown set the onTeardown event handler registerTask register a task for an operator subscribe add a subscriber for an event unsubscribe remove a subscriber or all subscribers

Operators accumulate aggregates a stream into an accumulator apply apply a function to incoming batches in the stream filter filter some or all elements from a batch keyBy (Beta) keys a stream on a value in the stream map apply a function to data passing through the operator merge merge two data streams parallel (Beta) applies multiple functions in parallel over a stream reduce aggregate partial windows rolling (Beta) a moving-window function to a stream split split the current stream sql execute an SQL query on tables in a stream union unite two streams

Readers read.fromAmazonS3 reads data from Amazon Web Services S3 buckets read.fromAzureStorage reads data from Azure Blob Storage read.fromCallback define callback in the q global namespace read.fromExpr evaluate expression or function into the pipeline read.fromFile read file contents into pipeline read.fromKafka consume data from a Kafka topic read.fromGoogleStorage reads data from Google Cloud Storage read.fromPostgres execute a query against a PostgreSQL database read.fromSQLServer execute a query against a SQL Server database read.fromStream read data using a KX Insights stream read.fromParquet (Beta) read Apache Parquet data from a cloud registry

Decoders decode.csv parse CSV data to a table decode.arrow (Beta) decode Arrow streams decode.gzip (Beta) decode gzipped data decode.json parse JSON data decode.protobuf parse Protocol Buffer messages

Encoders arrow (Beta) encode a stream as Arrow data csv encode tables as CSV data encode.json encode data in JSON format encode.protobuf encode Protocol Buffer messages

State get cached state of an operator set store state of an operator

Stats (Beta) sma calculate a simple moving average ema calculate an exponential moving average twa calculate a time weighted average

String string.toUpperCase uppercases specified incoming data string.toLowerCase lowercases specified incoming data

Transform transform.renameColumns renames columns in the incoming data transform.replaceInfinity replaces infinite values with min/max values transform.replaceNull replaces null values with the median value transform.timeSplit decomposes time columns into subdivisions of mins/hours etc. transform.fill (Beta) fill in null values in a table transform.schema transforms data to match a provided schema

Windows window.count aggregate stream into evenly sized windows window.global aggregate stream using a custom trigger function window.sliding aggregate stream in potentially overlapping windows window.timer aggregate stream by processing time window.tumbling aggregate stream into non-overlapping windows

Writers write.toAmazonS3 write to an object in Amazon S3 write.toConsole write to the console write.toDatabase write to KX Insights Database write.toKafka publish data on a Kafka topic write.toKDB (Beta) write data to an on-disk partitioned table write.toProcess write data to a kdb+ process write.toStream write data using a KX Insights stream write.toVariable write to a local variable

Machine Learning

Fresh ml.freshCreate turns batches of data into features based on aggregated statistics

Classification ml.adaBoostClassifier fits an adaBoost classification model ml.decisionTreeClassifier fits a decision tree classification model ml.gaussianNB fits a gaussian naive bayes model ml.kNeighborsClassifier fits a k-nearest neighbors classification model ml.logClassifier fits a logistic classification model using stochastic gradient descent ml.quadraticDiscriminantAnalysis fits a quadratic discriminant analysis model ml.randomForestClassifier fits a random forest classification model

Clustering ml.affinityPropagation fits an affinity propagation clustering model ml.birch fits a BIRCH clustering model ml.cure fits a CURE clustering model ml.dbscan fits a DBSCAN clustering model ml.sequentialKMeans fits a sequential k-means model

Regression ml.adaBoostRegressor fits an adaBoost regression model ml.gradientBoostingRegressor fits a gradient boosting regression model ml.kNeighborsRegressor fits a k-nearest neighbors regression model ml.lasso fits a lasso-linear regression model ml.linearRegression fits a linear regression model ml.randomForestRegressor fits a random forest regression model

Metrics ml.score evaluates a model's predictions

Preprocessing ml.dropConstant drops constant columns from incoming data ml.featureHasher encodes categorical data as numeric vectors ml.labelEncode encodes symbolic data into numerical values ml.minMaxScaler min-max scale a supplied dataset ml.oneHot replaces symbolic values with numerical vector representations ml.standardize standardizes a supplied dataset

Registry ml.registry.fit fits a model to batches of data, saving a model to a registry ml.registry.predict predicts a target variable using a trained model from the registry ml.registry.update trains a model incrementally, returning predictions for all records

Operator syntax

Pipeline API operators are designed to be chained, as in the examples, in a form that will be familiar to users of libraries such as jQuery.

APIs are executed from left to right (or top to bottom when newlines are added) and are designed to be composed for human readability. For example, in the following case, data would be read from Kafka, transformed through a JSON decoder, windowed and written to an Insights stream.

.qsp.run
    .qsp.read.fromKafka[`trades]
    .qsp.decode.json[]
    .qsp.window.tumbling[00:00:05; `time]
    .qsp.write.toStream[]

Implicit last argument

Each .qsp operator returns an object representing a ‘node’ configuration; and takes a node configuration as its last argument. That last argument is left implicit in the API documentation: each operator therefore has a rank higher than documented.

Pipeline API operators are invoked as projections on their implicit last arguments and therefore must be applied using bracket notation only, never prefix.

Implicit penultimate argument

Many Pipeline API operators are variadic; most take a penultimate argument of custom configuration options.

If .qsp.foo is a Pipeline API operator that takes arguments x and y and optionally cco a dictionary of custom configuration options, its true signatures are

.qsp.foo[x;y;node]
.qsp.foo[x;y;cco;node]

To limit duplication in the documentation, neither the cco nor the node arguments are documented. The signature would be documented as

.qsp.foo[x;y]

An important consequence is that

.qsp.foo[x;y]
.qsp.foo[x;y;cco]

are both unary projections.

Order of evaluation

Successive Pipeline API operators modify the node configuration and pass it along the chain for eventual evaluation, reversing the apparent evaluation order.

In

prd
  {x where x>5}
  til 8

q evaluates first til 8, then the lambda, then prd, but in

.qsp.run
  .qsp.read.fromExpr["refData"]
  .qsp.write.toConsole[]

because actual evaluation is handled by .qsp.run, the table refData is read before its contents are written to the console.