PodCastLM:把 PDF / URL 一键转成播客对话与音频

NotebookLM 风格的播客生产线:输入 PDF 或 URL,输出双人对话、摘要、节目简介与 MP3。

开源项目地址:https://github.com/ylouis83/Podcastllm

Highlights

  • 一键生成:播客脚本 + 摘要 + 节目简介,流式返回,前端实时渲染。
  • 多路 TTS:FishAudio、Azure Speech、Qwen3 TTS,主持人/嘉宾音色可切换。
  • 本地转写:内置 Whisper,本地推理;支持 Web 上传或 CLI 批量转写并导出时间戳。
  • 输入灵活:PDF 上传或用 Jina 抓取 URL。
  • 低延迟流式:对话生成、摘[……]

    Read more

Memory-X: A Temporal Memory System for AI Agents

Introduction

One of the biggest limitations of modern AI agents is memory. Most LLM-based agents can handle short-term context within a conversation, but they struggle with long-term memory: tracking facts, adapting to user preferences, and maintaining consistency across sessions.

Check ou[……]

Read more

数字人技术趋势

Recently, we explored some trends in digital human technology. Whether from the traditional Internet or vertical fields (including medical, social, etc.).

Applications of digital human technology is increasingly being used.

最近,我们探索了数字人技术的一些趋势。无论从传统互联网还是垂直领域(包括医疗,社交等)数字人技术的应用正在越来越多的被应用

Read this PDF:[……]

Read more

[尝鲜] AIGC and Chatgpt

属于Chatgpt的“iPhone”时刻来了,伴随着越来越多的模型能力放出,AIGC的场景越发丰富以及明朗,越来越多的业界大神也亲自下场操盘,投资界即将迎来一场技术的历史变革,各方都期待能瓜分到这巨大蛋糕的一角。今天我们就来浅尝一些功能性的应用。。。

OpenAI刚刚宣布推出插件功能,赋予ChatGPT使用工具、联网、运行计算的能力  Chatgpt-plugin

1、昨天该机构开放了第一批 ChatGPT 插件名单。这批插件由 Expedia、FiscalNote、Instacart、KAYAK、Klarna、Milo、OpenTable、Shopify、Slack、Speak、Wolf[……]

Read more

Presto-engine Privileges Control – Architect

We use presto as the main data query engine to provide data query services for business parties. In our architecture, presto as a core capability requires access to data query permission control, including permission control at the table and column levels. We designed The following scheme to achieve this requirement

[……]

Read more

Scroll to Top