当前位置：首页 > news >正文

Monorepo 增量构建：哈希指纹与缓存实践

news 2026/6/16 8:10:46

Monorepo 增量构建：哈希指纹与缓存实践

在 Monorepo 里放太多项目，构建时间确实会成倍增长。改一行样式代码，CI 要把所有子项目重新编译一遍，这谁受得了。

一、问题在哪

全量构建的浪费主要来自两点：

无差别重编译。只改了 App A 的样式，构建系统却把 App B 甚至后端子包也重新跑了一遍。这些子项目和本次变更毫无关系，但 CI 不管，照跑不误。

本地和 CI 各算各的。开发者本地测试已经通过了，推送到 CI 后又是一整套完整流程。本地缓存没法复用，CI 白白消耗算力。

核心思路其实很简单：给每个构建任务算一个输入哈希。如果输入没变，就直接用之前的输出，跳过编译。

二、哈希怎么算

流程分三步：

收集任务的所有输入：源文件内容、环境变量、依赖版本
用 SHA-256 生成一个 Input Hash
查缓存仓库有没有这个 Hash 对应的产物。有就下载解压，没有就正常编译并把结果存进去

sequenceDiagram autonumber actor Dev as 开发人员 / CI 节点 participant Engine as 任务编排引擎 participant FS as 本地文件系统 participant CacheStore as 缓存仓储 Dev->>Engine: 执行构建命令 activate Engine Engine->>FS: 递归扫描子项目源文件 FS-->>Engine: 返回文件列表与修改时间 Engine->>Engine: 计算 SHA-256 复合哈希 Engine->>CacheStore: 核对该 Hash 是否有缓存 activate CacheStore alt 缓存命中 CacheStore-->>Engine: 返回编译产物 (.tar.gz) Engine->>FS: 解压覆盖 dist/ 目录 Engine-->>Dev: 构建完成 (缓存命中) else 缓存未命中 CacheStore-->>Engine: 无缓存 deactivate CacheStore Engine->>Engine: 启动编译器执行编译 Engine->>FS: 写入编译产物到 dist/ Engine->>CacheStore: 打包 dist/ 并上传，绑定 Input Hash Engine-->>Dev: 编译完成，生成缓存备份 end deactivate Engine

三、代码实现

下面是一个简单的文件指纹扫描器，用 Node.js 写的，递归遍历目录并计算 SHA-256：

const fs = require('fs'); const path = require('path'); const crypto = require('crypto'); class FileFingerprinter { constructor(ignorePatterns = []) { this.ignorePatterns = [ 'node_modules', '.git', 'dist', '.DS_Store', ...ignorePatterns ]; } isIgnored(filePath) { return this.ignorePatterns.some(pattern => filePath.includes(pattern)); } getAllFiles(dir, fileList = []) { const files = fs.readdirSync(dir); files.forEach(file => { const fullPath = path.join(dir, file); if (this.isIgnored(fullPath)) return; if (fs.statSync(fullPath).isDirectory()) { this.getAllFiles(fullPath, fileList); } else { fileList.push(fullPath); } }); return fileList; } calculateDirectoryHash(dirPath) { const files = this.getAllFiles(dirPath).sort(); const hash = crypto.createHash('sha256'); files.forEach(filePath => { try { const content = fs.readFileSync(filePath); // 文件名和内容一起参与哈希，确保文件改名也能被感知 hash.update(path.relative(dirPath, filePath)); hash.update(content); } catch (err) { console.error(`读文件失败 ${filePath}:`, err.message); } }); return hash.digest('hex'); } } // 测试 const printer = new FileFingerprinter(); const mockProjectPath = path.resolve('./src'); if (fs.existsSync(mockProjectPath)) { const hash = printer.calculateDirectoryHash(mockProjectPath); console.log("指纹:", hash); }

几个注意点：