AI 每日进展速报 / Daily AI Digest - 2026-05-17

图像生成/编辑 / Image Generation/Editing

arXiv

GitHub

HuggingFace Models

视频生成/编辑 / Video Generation/Editing

arXiv

GitHub

HuggingFace Models

音频生成 / Audio Generation

arXiv

GitHub

HuggingFace Models

HuggingFace Spaces

语言大模型 / Large Language Models

arXiv

GitHub

HuggingFace Models

HuggingFace Datasets

Ended up with some tokens to burn on a Claude Max plan. Assembly began during 4.6 and moved to 4.7. Model is tagged. The develop...

HuggingFace Spaces

多模态大模型 / Multimodal Models

arXiv

GitHub

强化学习 / Reinforcement Learning

arXiv

GitHub

HuggingFace Datasets

Open-MM-RL is a multimodal STEM reasoning dataset covering Physics, Mathematics, Biology, and Chemistry. It is designed for...

中文说明

    Demo

If the video cannot be displayed in your environment, open it directly: assets/syndata-demo.mp4

    ...

The largest agentic-coding trace dataset to date: 112 B tokens of execution-free agentic trajectories covering 12...

One million executable, interpretable CAD construction sequences synthesized entirely without real-world data.

...

世界动作模型 / World Action Model

arXiv

GitHub


Generated automatically by Daily AI Digest Agent 生成时间: 2026-05-17 01:00:19