Today’s News Highlights
🚀 Domestic Rising Star: Magi-1 Autoregressive Video Model Open-Sourced, Leading in Physical Realism
💰 Capital Surge: Manus AI Secures $75 Million Led by Benchmark, Valuation Soars
🤖 Future of Agents: Illia Polosukhin Foresees AI Assistants Reshaping the Internet
📚 Code Encyclopedia: Cognition Launches DeepWiki, One-Click Interpretation of GitHub Projects
01 🚀 Domestic Rising Star: Magi-1 Autoregressive Video Model Open-Sourced, Leading in Physical Realism

Recently, the autoregressive video generation model Magi-1, open-sourced by the Beijing Sand AI team, has caused a sensation in the industry, garnering over 1.7k stars on GitHub in just two days. Magi-1 innovatively adopts a block-by-block predictive autoregressive architecture to generate videos, differing from the one-time generation approach of models like Sora. The model has demonstrated a “breakthrough” lead in physical realism benchmarks and is regarded as a significant milestone for open-source AI in China. It was developed by a team led by Cao Yue, a recipient of the Marr Prize and a Tsinghua Distinguished Doctoral Dissertation award.
Key Highlights
Researcher Insights For Practitioners: Magi-1 offers a robust video generation framework distinct from mainstream diffusion models, holding significant research value, particularly in physical simulation, causal reasoning, and long video generation. The autoregressive path has been validated to have notable advantages in specific tasks, potentially inspiring new ideas. However, attention should be paid to the hardware costs associated with large models and the fine-tuning challenges that new architectures may present. For General Users/Industry: Magi-1 exemplifies the rise of China’s open-source AI power, demonstrating the potential to catch up with and even surpass leading technologies in cutting-edge video generation. Its breakthrough in physical realism indicates an enhanced ability of AI to understand and simulate real-world laws, opening up imaginative possibilities for simulation, robotics training, and more. As a high-performance open-source model, it will accelerate global technological iteration, challenge the landscape of closed-source models, and enhance China’s influence in the global AI ecosystem. Its excellent performance in physical simulation may validate the advantages of the autoregressive method in simulating real physical laws.
Recommended Reading
- Magi-1 GitHub Repository
- Autoregressive Architecture: Employs an autoregressive denoising algorithm to predict and generate videos block by block (24 frames per block), supporting causal temporal modeling and streaming generation. The underlying structure is based on the Diffusion Transformer, incorporating multiple innovations to enhance efficiency and stability.
- Leading in Physics: In the Physics-IQ benchmark test, Magi-1 achieved a Physics IQ score of 56.02, far surpassing VideoPoet (29.50) and Sora (10.00), demonstrating exceptional physical realism.
- Open-Source Contribution: Fully open-sourced codebase, technical report, and pre-trained model weights in various parameter sizes (24B/4.5B) provide powerful tools for the research community.
- Controllable Generation: Supports block-level prompts for fine-grained control, scene transitions, long video synthesis, and offers multiple modes such as text-to-video (T2V), image-to-video (I2V), and video-to-video (V2V).
02 💰 Capital Surge: Manus AI Secures $75 Million Led by Benchmark, Valuation Soars

Chinese AI startup “Butterfly Effect” (developer of Manus AI) announced the completion of a $75 million funding round led by top Silicon Valley VC Benchmark, with a post-investment valuation soaring to nearly $500 million, an increase of approximately fivefold. Manus AI is positioned as a general-purpose AI Agent aimed at handling complex tasks such as resume screening, itinerary planning, and stock analysis. The new funding will be used to accelerate expansion in overseas markets such as the United States, Japan, and the Middle East.
Key Highlights
- Massive Funding: Secured $75 million in funding, led by Benchmark, with a post-investment valuation of nearly $500 million, indicating strong recognition from the capital market.
- General-Purpose Agent: Positioned as a general-purpose AI Agent capable of executing complex tasks, employing a multi-agent architecture with information retrieval, data analysis, tool integration, multi-modal processing, and adaptive learning capabilities.
- Global Expansion: Explicit plans to leverage funding to accelerate globalization, with a focus on expanding into the US, Japanese, and Middle Eastern markets, and plans to establish its first overseas office in Tokyo.
- Business Model: Has launched subscription services priced at $39 and $199 per month. Simultaneously, to address high operating costs and geopolitical considerations, a strategic partnership has been established with Alibaba Cloud’s Tongyi Qianwen.
Researcher Insights For Practitioners: Manus AI’s successful funding significantly boosts confidence in the AI Agent track, indicating the market’s optimism about the potential for automating complex knowledge work. However, its high operating costs (reliance on third-party large models) reveal economic challenges. The partnership with Alibaba Cloud provides a case study for observing how startups can leverage local resources, balance cost and performance, and mitigate risks. For General Users/Industry: The rise of Manus AI suggests that AI assistants capable of handling complex tasks are accelerating towards commercialization. Its high valuation and global expansion plans indicate its growing industry influence. Benchmark’s investment is particularly significant in the current environment, potentially reflecting its confidence in Manus’s application-layer technology and global potential. The gap between Manus AI’s high operating costs and its subscription fees raises concerns about the sustainability of its business model, highlighting a common challenge in current Agent development. Its valuation’s over 16-fold increase within a year reflects the sector’s fervor.
Recommended Reading
- Investing.com Report
- AIbase Report
03 🤖 Future of Agents: Illia Polosukhin Foresees AI Assistants Reshaping the Internet

Illia Polosukhin, co-author of the Transformer paper and co-founder of NEAR Protocol, shared his insights on the future of AI Agents in the a16z crypto podcast. He predicts that future Agents will be intelligent assistants that proactively propose solutions and may become the primary entry point for users to interact with the digital world, ultimately replacing traditional websites and apps. He also believes that Agents can act as information “garbage sorters,” combating the “dead internet,” and predicts that the number of Agents will far exceed the human population in the future.
Key Highlights
- Proactive Intelligence: Agents will evolve from passively executing instructions to proactively proposing solutions, with users only needing to make directional choices. Mature applications are expected within a year.
- End of Apps: Envisions an “Agentic Internet” future where users interact with services through personal Agents, potentially rendering traditional websites/apps unnecessary interfaces.
- Information Stewards: Agents can become users’ “internet garbage sorters,” actively verifying information, revealing truths, and combating online misinformation.
- Intelligent Network: In the future, everyone will have AI assistants, backed by a large number of sub-agents, forming a vast network of trillions of Agents providing an “on-demand assistant system.”
Researcher Insights For Practitioners: Polosukhin’s views depict a disruptive future interaction paradigm, where the development focus may shift from UI to robust service APIs. Agent planning, reasoning, tool usage, and trustworthiness will be core competencies. The concept of “user-owned” AI raises considerations about data sovereignty, privacy, and the role of decentralized infrastructure (such as NEAR). For General Users/Industry: This foreshadows a potentially simpler, more personalized, and trustworthy internet experience. It will have profound implications for existing digital advertising, app stores, search engines, and other business models. The interaction logic may shift from users actively “pulling” information to Agents proactively “pushing”/executing tasks. The success of Agents as information stewards hinges on their credibility and value alignment. The concept of “user-owned” and the transparency and user control of Agents become crucial.
Recommended Reading
- a16z Podcast Interview: AI, Apps & the Agentic Internet with NEAR’s Illia Polosukhin
04 📚 Code Encyclopedia: Cognition Launches DeepWiki, One-Click Interpretation of GitHub Projects

Cognition, the company that developed the AI software engineer Devin, has launched DeepWiki, a free tool that can transform any public GitHub code repository into an interactive Wiki page. Users only need to modify the URL (github.com -> deepwiki.com) to access an AI-generated project encyclopedia, aiming to make codebases easier to understand, especially for learners. The tool has gained popularity since its launch, indexing over 30,000 repositories and analyzing 4 billion lines of code.
Key Highlights
- One-Click Encyclopedia: Generates a Wiki for public repositories simply by modifying the URL, requiring no registration or login, making it extremely easy to use. Provides a search page for convenient browsing.
- Intelligent Interpretation: Utilizes AI to analyze code, README files, etc., automatically generating structured documentation, explaining code structure, key functions, dependencies, and generating interactive diagrams.
- In-Depth Research: Integrates a Devin-powered AI chat assistant that allows users to select Wiki text and ask questions, receiving context-aware answers based on the codebase, supporting advanced analysis queries.
- Open and Easy to Use: Free for all public GitHub repositories, with significant computational resources invested (over $300,000 in costs), demonstrating a commitment to promoting the accessibility of open-source code.
Researcher Insights For Developers (Practitioners): DeepWiki is an extremely valuable tool that can significantly save time and effort in understanding unfamiliar codebases, lowering the barrier to entry for reading code. The integrated chat function provides interactive learning and potential debugging assistance. The accuracy of AI-generated content still needs observation. For General Users (Beginners)/Industry: Greatly lowers the barrier to learning programming and participating in open-source contributions, promoting knowledge sharing. Demonstrates the application potential of AI in enhancing developer productivity, potentially putting pressure on platforms like GitHub. DeepWiki exemplifies AI’s shift from “replacement” to “enhancement” of developers, addressing the pain point of understanding existing code and is expected to promote the development of the open-source ecosystem. Cognition’s significant investment likely involves multiple strategic considerations (improving AI, traffic entry, brand building).
Recommended Reading
Today’s Summary
Today’s AI developments showcase several key trends:
- Continuous Breakthroughs in Video Generation Technology: Represented by the domestic open-source model Magi-1, new architectures (such as autoregressive) are driving significant progress in key indicators like physical realism in video generation, challenging existing technological paradigms.
- Sustained Enthusiasm in the AI Agent Sector: Manus AI’s substantial investment from a top-tier VC further demonstrates the market’s high expectations for AI Agents capable of automating complex tasks, although challenges remain regarding their business models and cost control.
- In-Depth Thinking on Future Interaction Paradigms: Industry thought leaders like Illia Polosukhin are envisioning a future internet dominated by AI Agents, potentially disrupting existing app and website models, sparking profound discussions on technological architecture, business models, and user trust.
- AI Empowering the Developer Ecosystem: Innovative tools like DeepWiki are leveraging AI to significantly improve developers’ workflows and experiences, lowering the barrier to understanding code and potentially injecting new vitality into the open-source community.
Overall, while the AI field demonstrates exciting technological potential and application prospects, it is also accompanied by ongoing exploration of costs, stability, credibility, and sustainable business models.