922a57fe34
Self-hosted 開源:WASM 零件 + recipe + cypher-executor,跑在你自己的 Cloudflare。 此為重建的乾淨歷史起點(移除曾誤 commit 的 GCP SA 金鑰,舊歷史保留在 richblack/arcrun 與本地 backup 分支)。含: - acr init --self-hosted installer(建 KV/R2 + codeload 拉預編譯 wasm + wrangler deploy + seed recipe) - recipe push 把關(資料外流提醒 + 打通檢查) - 19 個正當零件預編譯 wasm(claude_api/km_writer/kbdb_upsert_block 排除:違反 DECISIONS §1) - CLI / cypher-executor / registry / 完整 SDD Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
26 lines
748 B
YAML
26 lines
748 B
YAML
name: pdf_to_blocks
|
|
description: 收 PDF URL → 轉文字 → 切 chunk → 存 KBDB 每塊一個 block
|
|
|
|
flow:
|
|
- "input >> ON_SUCCESS >> convert_pdf"
|
|
- "convert_pdf >> ON_SUCCESS >> ingest_to_kbdb"
|
|
|
|
config:
|
|
convert_pdf:
|
|
component: http_request
|
|
url: "https://kbdb.finally.click/convert"
|
|
method: POST
|
|
body_json:
|
|
file_url: "{{input.pdf_url}}"
|
|
format: "text"
|
|
|
|
# kbdb_ingest 自動 chunk + 寫 blocks(每塊 ~500 字)
|
|
# source 用 file_url 當去重 key(同 PDF 重 ingest 不會重複建)
|
|
ingest_to_kbdb:
|
|
component: kbdb_ingest
|
|
api_key: "{{api_key}}"
|
|
page_name: "pdf-{{input.title}}"
|
|
text: "{{convert_pdf.data.text}}"
|
|
source: "pdf:{{input.pdf_url}}"
|
|
user_id: "{{input.user_id}}"
|