尧图网站建设 尧图网络
  • 首页
  • 关于我们
  • 服务项目
  • 案例展示
  • 建站流程
  • 资讯中心
  • 联系我们
首页/资讯中心/详情

PyFlink - datagen connector

PyFlink - datagen connector
📅 发布时间:2026/6/18 23:31:29
PyFlink - datagen connector

 

from pyflink.table import TableEnvironment, EnvironmentSettingssettings = EnvironmentSettings.in_streaming_mode()
t_env = TableEnvironment.create(settings)t_env.execute_sql("""CREATE TABLE MySource (id INT,name STRING) WITH ('connector' = 'datagen')
""")t_env.execute_sql("SELECT * FROM MySource").print()

For the above code, where will the table MySource be stored?

 

Short answer:
👉 MySource is not stored anywhere.
It is a logical (virtual) table, backed by a data generator, and exists only in the running Flink job.


Why MySource is not stored

1. datagen is a source, not storage

'connector' = 'datagen'
  • datagen generates data on the fly

  • No files, no database, no persistent storage

  • Rows are created in memory at runtime


2. Table API tables are metadata + connectors

In Flink, a table consists of:

  • Schema (columns, types)

  • Connector definition

  • Optional catalog metadata

It does not imply physical storage unless the connector provides it.


What actually happens at runtime

SELECT * FROM MySource
  • Flink starts the job

  • datagen produces rows (random or sequential)

  • Rows flow through the pipeline

  • .print() writes results to stdout

Once the job stops:

  • MySource disappears

  • Generated data is lost


Where the table definition lives

ScopeExists where
Table schema Flink job (JVM memory)
Generated rows TaskManager memory
Persistence ❌ None

Since you didn’t configure:

  • A catalog (Hive, JDBC, filesystem)

  • A sink (Kafka, filesystem, DB)

Nothing is saved.


How tables can be stored persistently

1. Use a catalog (for metadata)

t_env.execute_sql("""
CREATE CATALOG my_catalog WITH ('type' = 'hive',...
)
""")

2. Use a persistent connector

Examples:

  • Filesystem (HDFS / S3)

  • Kafka

  • JDBC

  • Iceberg / Hudi / Delta

Example sink:

CREATE TABLE MySink (id INT,name STRING
) WITH ('connector' = 'filesystem','path' = '/tmp/output','format' = 'csv'
)

Key takeaway

In Flink, tables are logical abstractions.
They are only “stored” if the connector stores data.

Your MySource table:

    • ❌ Not stored on disk

    • ❌ Not stored in a database

    • ✅ Generated dynamically

    • ✅ Exists only during job execution

 

(airflow-venv) frank@ZZHPC:~/zproject$ flink run -py my_job.py
| +I |   549674300 | ac2be66685e79f7152901d0d61a... |
| +I | -1259077681 | 7773378ffd59e62096726aa8b26... |
| +I |  -112218307 | 9f1e979db8abf8cc2e4c1d8310d... |
| +I | -1392104357 | a7dba2cd32fe9c48a759d8a886d... |
| +I | -1516081769 | aa0ca72fb671b46c2b57545cf71... |
| +I |   380579405 | 861ce6e96d244dd6cebd243ec37... |
| +I |  1748819335 | bd162fcbd6c39ad5e47f35055b7... |
| +I | -1648463948 | 37e9643a96a4de8a355928159ea... |
| +I | -1925178175 | 20de30b7135ac186f2461130b11... |
| +I | -1913158356 | c8709a8bbacc2373c24927df314... |
| +I |  -844590625 | c6aae7818d2ddf6f9fd6cdcb836... |
| +I |   551639609 | 2279a5172fcb034030d3182d1fd... |
| +I |   115702922 | 407f6ad6dd3b41b8f65315a9113... |
| +I | -1249309820 | 04fa0efbe2720b23ac369610af2... |
| +I |   307810094 | 4f8211ad2a2daff8142a15ae497... |
| +I |   642408656 | 63e9f697a28324a551124c41f7d... |
| +I |   -26470531 | 0572620b06e7b0e96e722849e85... |
| +I |  -237334693 | 588eb2a24c70a0f0b19f592f904... |
| +I |   680441998 | c2ad60758acc6ea210e54112e02... |
| +I |  1629320224 | d08a9e4e89a873256b0c11f1291... |
......

 

相关新闻

  • 有机废气治理厂家怎么选?voc治理厂家/沸石转轮厂家/rto蓄热燃烧炉厂家/to直燃炉厂家/co催化燃烧炉厂家,这份靠谱推荐指南请收好 - 品牌评测官
  • 界面控件DevExpress WinForms中文教程:Data Grid - 总摘要
  • 2025年女孩取名机构联系方式汇总:全国主要服务机构官方联系通道与专业选择指南 - 品牌推荐

最新新闻

  • 2022 AI工程化落地实操指南:从大模型到可控生成与指令微调
  • MPC857T勘误文档解析:嵌入式开发中规避硬件设计陷阱的关键
  • 团队冲刺7
  • 文心5.0技术解剖:2.4万亿参数与原生全模态架构深度解析
  • 开关磁阻电机高压功率级设计:IGBT驱动与逐周期限流解析
  • 终极指南:OpenCore Legacy Patcher免费让老旧Mac焕发新生

日新闻

  • 2026年不锈钢卷板厂家推荐排行榜:冷轧热轧/304/201不锈钢卷板,高颜值耐腐蚀源头厂家实力精选 - 企业推荐官【官方】
  • FLUX.1-dev FP8模型实战指南:24GB以下显卡高效部署方案
  • 2026佛山长途搬家价目表:跨省跨市搬家费用完整计算指南 - 从来都是英雄出少年

周新闻

  • 3步解锁iOS设备:applera1n激活锁绕过完全指南
  • 39 2026 人工智能证书终极盘点,普通人选 AI 证书可以从这些方向入手
  • Redis 暴露公网有多危险?从端口检查到补救步骤

月新闻

  • 【总结】入门篇:50句话让你记住架构核心概念
  • WeChatMsg技术方案解析:实现Mac微信数据自主管理的完整解决方案
  • WeChatMsg:革新性微信数据备份方案,打造你的专属数字记忆库

关于尧图

  • 公司简介
  • 团队介绍
  • 企业文化
  • 荣誉资质

服务项目

  • 定制开发
  • 电商建站
  • UI 设计
  • 运维服务

快速链接

  • 案例展示
  • 建站流程
  • 常见问题
  • 资讯中心

联系方式

  • 📍北京市朝阳区互联网产业园 A 座 10 层
  • 📞400-888-8888
  • ✉️contact@rkmt.cn
  • 🕐周一至周日 9:00-21:00

© 2024 北京尧图网络科技有限公司 版权所有 | 京 ICP 备 XXXXXXXX 号