Version: Next

Virtual Tables

Virtual tables in Fluss are system-generated tables that provide access to metadata, change data, and other system information without storing additional data. They are accessed by appending a suffix (e.g., $changelog) to the base table name.

Fluss supports the following virtual table types:

Virtual Table	Suffix	Description	Supported Tables
Changelog	`$changelog`	Provides access to the raw changelog stream with metadata	Primary Key Tables, Log Tables
Binlog	`$binlog`	Provides binlog format with before/after metadata	Primary Key Tables only

Changelog Table

The $changelog virtual table provides read-only access to the raw changelog stream of a table, allowing you to audit and process all data changes with their associated metadata.

Accessing the Changelog

To access the changelog of a table, append $changelog to the table name:

Flink SQL
SELECT * FROM my_table$changelog;

Schema

The changelog virtual table includes three metadata columns prepended to the original table columns:

Column	Type	Description
`_change_type`	STRING NOT NULL	The type of change operation (see Change Types)
`_log_offset`	BIGINT NOT NULL	The offset position in the log
`_commit_timestamp`	TIMESTAMP_LTZ(3) NOT NULL	The timestamp when the change was committed

Followed by all columns from the base table.

Change Types

The _change_type column indicates the type of data modification:

Primary Key Tables

For Primary Key Tables, the following change types are supported:

Change Type	Description
`insert`	A new row was inserted
`update_before`	The previous value of an updated row (retraction)
`update_after`	The new value of an updated row
`delete`	A row was deleted

Log Tables

For Log Tables (append-only), only one change type is used:

Change Type	Description
`insert`	A new row was inserted into the log

Examples

Consider a Primary Key Table tracking user orders:

Flink SQL
-- Create a primary key table
CREATE TABLE orders (
    order_id INT NOT NULL,
    customer_name STRING,
    amount DECIMAL(10, 2),
    PRIMARY KEY (order_id) NOT ENFORCED
) WITH ('bucket.num' = '1');

-- Insert a record
INSERT INTO orders VALUES (1, 'Rhea', 100.00);

-- Update the record
INSERT INTO orders VALUES (1, 'Rhea', 150.00);

-- Delete the record
DELETE FROM orders WHERE order_id = 1;

-- Query the changelog
SELECT * FROM orders$changelog;

Output:

+---------------+-------------+---------------------+----------+---------------+---------+
| _change_type  | _log_offset | _commit_timestamp   | order_id | customer_name | amount  |
+---------------+-------------+---------------------+----------+---------------+---------+
| insert        |           0 | 2024-01-15 10:30:00 |        1 | Rhea          |  100.00 |
| update_before |           1 | 2024-01-15 10:35:00 |        1 | Rhea          |  100.00 |
| update_after  |           2 | 2024-01-15 10:35:00 |        1 | Rhea          |  150.00 |
| delete        |           3 | 2024-01-15 10:40:00 |        1 | Rhea          |  150.00 |
+---------------+-------------+---------------------+----------+---------------+---------+

Startup Modes

Mode	Description
`earliest`	Start reading from the beginning of the log
`latest`	Start reading from the current end of the log (only new changes)
`timestamp`	Start reading from a specific timestamp (milliseconds since epoch)

The changelog table supports different startup modes to control where reading begins:

Flink SQL
-- Read from the beginning (default)
SELECT * FROM orders$changelog /*+ OPTIONS('scan.startup.mode' = 'earliest') */;

-- Read only new changes from now
SELECT * FROM orders$changelog /*+ OPTIONS('scan.startup.mode' = 'latest') */;

-- Read from a specific timestamp
SELECT * FROM orders$changelog /*+ OPTIONS('scan.startup.mode' = 'timestamp', 'scan.startup.timestamp' = '1705312200000') */;

Limitations

Projection, partition, and predicate pushdowns are not supported yet. This will be addressed in future releases.

Binlog Table

The $binlog virtual table provides access to change data where each record contains both the before and after images of the row. This is useful for:

note

The $binlog virtual table is only available for Primary Key Tables.

Accessing the Binlog

To access the binlog of a Primary Key Table, append $binlog to the table name:

Flink SQL
SELECT * FROM my_pk_table$binlog;

Schema

The binlog virtual table includes three metadata columns followed by nested before and after row structures:

Column	Type	Description
`_change_type`	STRING NOT NULL	The type of change operation: `insert`, `update`, or `delete`
`_log_offset`	BIGINT NOT NULL	The offset position in the log
`_commit_timestamp`	TIMESTAMP_LTZ(3) NOT NULL	The timestamp when the change was committed
`before`	ROW<...>	The row values before the change (NULL for inserts)
`after`	ROW<...>	The row values after the change (NULL for deletes)

The before and after columns are nested ROW types containing all columns from the base table.

Change Types

Change Type	Description	`before`	`after`
`insert`	A new row was inserted	NULL	Contains new row values
`update`	A row was updated	Contains old row values	Contains new row values
`delete`	A row was deleted	Contains deleted row values	NULL

Examples

Flink SQL
-- Create a primary key table
CREATE TABLE users (
    user_id INT NOT NULL,
    name STRING,
    email STRING,
    PRIMARY KEY (user_id) NOT ENFORCED
) WITH ('bucket.num' = '1');

-- Insert, update, then delete a record
INSERT INTO users VALUES (1, 'Alice', 'alice@example.com');
INSERT INTO users VALUES (1, 'Alice Smith', 'alice.smith@example.com');
DELETE FROM users WHERE user_id = 1;

-- Query the binlog
SELECT * FROM users$binlog;

Output:

+--------------+-------------+---------------------+----------------------------------+--------------------------------------+
| _change_type | _log_offset | _commit_timestamp   | before                           | after                                |
+--------------+-------------+---------------------+----------------------------------+--------------------------------------+
| insert       |           0 | 2024-01-15 10:30:00 | NULL                             | (1, Alice, alice@example.com)        |
| update       |           2 | 2024-01-15 10:35:00 | (1, Alice, alice@example.com)    | (1, Alice Smith, alice.smith@example.com) |
| delete       |           3 | 2024-01-15 10:40:00 | (1, Alice Smith, alice.smith@example.com) | NULL                        |
+--------------+-------------+---------------------+----------------------------------+--------------------------------------+

Accessing Nested Fields

You can access individual fields from the before and after structures:

Flink SQL
SELECT
    _change_type,
    _commit_timestamp,
    `before`.name AS old_name,
    `after`.name AS new_name
FROM users$binlog
WHERE _change_type = 'update';

Startup Modes

Mode	Description
`earliest`	Start reading from the beginning of the log
`latest`	Start reading from the current end of the log (only new changes)
`timestamp`	Start reading from a specific timestamp (milliseconds since epoch)

The binlog table supports different startup modes to control where reading begins:

Flink SQL
-- Read from the beginning (default)
SELECT * FROM orders$binlog /*+ OPTIONS('scan.startup.mode' = 'earliest') */;

-- Read only new changes from now
SELECT * FROM orders$binlog /*+ OPTIONS('scan.startup.mode' = 'latest') */;

-- Read from a specific timestamp
SELECT * FROM orders$binlog /*+ OPTIONS('scan.startup.mode' = 'timestamp', 'scan.startup.timestamp' = '1705312200000') */;

Limitations

Projection, partition, and predicate pushdowns are not supported yet. This will be addressed in future releases.

Changelog Table​

Accessing the Changelog​

Schema​

Change Types​

Primary Key Tables​

Log Tables​

Examples​

Startup Modes​

Limitations​

Binlog Table​

Accessing the Binlog​

Schema​

Change Types​

Examples​

Accessing Nested Fields​

Startup Modes​

Limitations​

Changelog Table

Accessing the Changelog

Schema

Change Types

Primary Key Tables

Log Tables

Examples

Startup Modes

Limitations

Binlog Table

Accessing the Binlog

Schema

Change Types

Examples

Accessing Nested Fields

Startup Modes

Limitations