Dòng tin

72 nội dung mới nhất

Mới hôm nay

Simon WillisonGitHubRelease·khoảng 7 giờ trướcMớiHot

Simon Willison phát hành datasette 1.0a32

simonw released 1.0a32 at simonw/datasette

›Phát hành datasette 1.0a32 với hỗ trợ cho các mệnh đề SQLite INSERT ... RETURNING.
›Khắc phục lỗi liên quan đến cấu hình base_url trong các tình huống sử dụng Service Workers.
›Cải thiện khả năng tương thích với /db/-/execute-write endpoint để thực thi các truy vấn ghi dữ liệu.

#datasette #SQLite #phát triển

Simon WillisonBlogBài viết·khoảng 7 giờ trướcMớiHot

Phiên bản datasette 1.0a32 ra mắt

datasette 1.0a32

›Phiên bản sửa lỗi minor cho datasette 1.0a32 được phát hành với các cải tiến về tính ổn định.
›Sửa lỗi với các truy vấn INSERT ... RETURNING qua điểm cuối /db/-/execute-write mới.
›Khắc phục nhiều vấn đề liên quan đến cấu hình base_url khi dùng Service Workers.

#datasette #SQLite #công cụ phát triển

Simon WillisonXBài đăng·khoảng 9 giờ trướcMới

Tin vui: Nút bị xoá vô tình sẽ quay trở lại

R to @simonw: Excellent news, this was a mistake, it's coming back! https://x.com/PhilippSpiess/status/2061189796689494339

›Nút 'Copy as Markdown' đã bị xoá vô tình trong quá trình sắp xếp lại menu gần đây.

#Codex Desktop #Trải nghiệm người dùng

Simon WillisonXBài đăng·khoảng 9 giờ trướcMới

Cập nhật: Tính năng Copy as Markdown sẽ được khôi phục

RT by @simonw: Sorry about this! The button was unintentionally removed during some recent menu restructurings. It'll be back with the next hotfix release. You can bind a custom keyboard shortcut to the "Copy as Markdown" command and use that as a workaround!

›Nút bị xoá vô tình trong quá trình sắp xếp lại menu của Codex Desktop.

#Codex #Bảo trì phần mềm

AK (_akhaliq)XBài đăng·khoảng 10 giờ trướcMới

Mô hình Suyra OCR 2 mới đứng đầu xu hướng trên Papers with Code

RT by @_akhaliq: The new Suyra OCR 2 model is top trending on http://paperswithcode.co Also can you spot the new "Conferences" tab? 👀

›Mô hình OCR (Optical Character Recognition) Suyra 2 đang là xu hướng hàng đầu trên paperswithcode.co.

#OCR #Xử lý hình ảnh #Papers with Code

DAIR.AIXBài đăng·khoảng 10 giờ trướcMới

PewDiePie xây dựng agent orchestrator riêng

RT by @dair_ai: 😂PewDiePie building his own agent orchestrator and releasing it was not on my 2026 bingo card. Own the agent. Own the harness. It's not that hard, folks.

›PewDiePie phát hành agent orchestrator do chính mình xây dựng.

#Agent Orchestrator #Agent AI #Công cụ AI

Simon WillisonXBài đăng·khoảng 11 giờ trướcMới

Quan ngại: Người dùng sợ cập nhật sẽ xoá các tính năng cần thiết

R to @simonw: I'd been letting Codex Desktop update itself, now I feel like I need to hold out on upgrades in case they remove key features that I'm depending on!

›Người dùng dừng cập nhật tự động Codex Desktop vì lo sợ mất các tính năng quan trọng.

#Codex Desktop #Quản lý phiên bản

DAIR.AIXBài đăng·khoảng 13 giờ trướcMới

Biên giới Hiệu quả: Tối ưu quản lý ngữ cảnh cho agent

RT by @dair_ai: // The Efficiency Frontier // Cool paper on context management. As agents reuse the same documents and histories across many turns, the cheapest context strategy is not fixed. This work describes a principled rule for picking one per deployment instead of defaulting to whatever topped a benchmark in isolation. Retrieval and compression methods are almost always benchmarked on accuracy and cost separately, so you never learn when one actually beats another under real load. The Efficiency Frontier models context strategy selection as a single cost-performance problem, with a log-utility term for diminishing returns from extra context and a reuse parameter N that amortizes preprocessing across repeated queries. Sweep N and the optimal strategy changes, exposing crossover regions where retrieval, compression, or full context each wins. On 5,000 HotpotQA instances, deployment-aware selection cuts effective token usage about 25 percent at the same performance, and amortized memory compression runs over 50 percent cheaper than full-context prompting in higher-performance settings. Paper: https://arxiv.org/abs/2605.23071 Learn to build effective AI agents in our academy: https://academy.dair.ai/

›Bài báo đề xuất quy tắc chọn chiến lược quản lý ngữ cảnh tối ưu tùy theo từng triển khai.

#Context Management #Agent AI #Tối ưu hóa

DAIR.AIXBài đăng·khoảng 15 giờ trướcMới

Những bài báo AI hàng đầu của tuần (24-31 Tháng 5)

Pinned: The Top AI Papers of the Week (May 24 - May 31) - SkillOpt - AutoScientists - The Efficiency Frontier - Language Models Need Sleep - Adapting the Interface, Not the Model - Forecasting Scientific Progress with AI - Compiling Agentic Workflows into Weights Read on for more:

›Tổng hợp 7 bài báo nổi bật: SkillOpt, AutoScientists, The Efficiency Frontier, Language Models Need Sleep, Adapting the Interface, Forecasting Scientific Progress, Compiling Agentic Workflows.

#Nghiên cứu AI #Papers #Tuần báo cáo

DAIR.AIXBài đăng·khoảng 15 giờ trướcMới

Bài viết trên X

x.com/i/article/206078258805…

›Liên kết bài viết trên nền tảng X nhưng nội dung không hiển thị rõ trong nguồn.

#AI #Tin tức

AK (_akhaliq)XBài đăng·khoảng 15 giờ trướcMới

Top các bài báo AI của tuần (25-31 tháng 5)

RT by @_akhaliq: Top AI Papers of The Week (May 25-31): - Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players - SkillOpt: Executive Strategy for Self-Evolving Agent Skills - Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments by Alibaba - LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding - AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security - DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning - Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models - WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation - Rethinking Cross-Layer Information Routing in Diffusion Transformers Find them below:

›Bài báo về mô hình thế giới đa tác nhân (Gamma-World) và chiến lược tiến hóa kỹ năng tác nhân (SkillOpt) nổi bật tuần này.

#Agents #Vision-Language #Reinforcement Learning #Diffusion Models

AK (_akhaliq)XBài đăng·khoảng 17 giờ trướcMới

Thử mô hình Step 3.7 Flash mà không cần viết code

RT by @_akhaliq: Want to try Step 3.7 Flash without touching a line of code? It now has a hosted demo — open it right in your browser, no install needed. Built on Gradio by @_akhaliq 🙏, now live in our Hugging Face org. Give it a try 👇

›Mô hình Step 3.7 Flash giờ có demo trực tuyến có thể mở ngay trong trình duyệt mà không cần cài đặt.

#LLM #Hugging Face #Demo trực tuyến

Trước đó

DAIR.AIXBài đăng·1 ngày trước

Đường biên hiệu quả: GPT-5.6 sẽ đứng ở đâu?

RT by @dair_ai: The efficiency frontier! Where do you think GPT-5.6 will land?

›Claude Opus 4.8 đạt 58% Pass@1 trên DeepSWE Bench, xếp thứ 2 sau GPT-5.5.

#LLM #Benchmark #Hiệu quả tính toán

Jeremy HowardXBài đăng·1 ngày trước

Codex tìm ra cách bypass quyền sudo trên máy tính

RT by @jeremyphoward: Codex just found a “workaround” of not having sudo on my pc…

›Codex AI phát hiện một giải pháp sáng tạo để vượt qua hạn chế quyền sudo

#Codex #AI Coding

Yann LeCunXBài đăng·1 ngày trước

Yann LeCun ủng hộ công trình vaccine cứu sống nhân loại của Peter Hotez

RT by @ylecun: I'm MAGA's twisted sense of reality and morality, a scientist who developed treatments and vaccines that have saved hundreds of thousands of lives (without getting rich from it) should be sent to Gitmo. "In addition to continuing work on vaccines already in clinical trials for hookworm as of 2010 and schistosomiasis, Hotez led a team of researchers developing vaccines against other diseases including leishmaniasis, Chagas disease, SARS, and MERS, As of 2020, he was also working in development of a Coronavirus vaccine. With Maria Elena Bottazzi, he led the team that designed COVID-19 vaccine named Corbevax." https://en.wikipedia.org/wiki/Peter_Hotez

›Peter Hotez là nhà khoa học đã phát triển nhiều vắcxin và điều trị cứu sống hàng trăm nghìn người (hookworm, schistosomiasis, Chagas, SARS, MERS, COVID-19 Corbevax).

#Vaccine research #Global health #Science policy

François CholletXBài đăng·1 ngày trước

Sự kết thúc sẽ bắt đầu khi loài người quay lưng lại với loài người

The end will begin when humanity turns away from humanity

›Suy ngẫm triết học về rủi ro AI: khi loài người từ bỏ các giá trị nhân văn, nó dẫn đến sự hủy diệt.

#AI Ethics #Humanity #Risk Assessment #Philosophy

Yann LeCunXBài đăng·2 ngày trước

Giá dầu sắp leo thang vì kho dự trữ toàn cầu cạn kiệt

RT by @ylecun: Exxon Says We’re Two Weeks From Petrol Armageddon Right. So it turns out closing the world’s most important oil chokepoint has consequences. Who knew. Exxon’s Senior VP Neil Chapman stood up at a Bernstein conference this week and said what everyone in the industry is thinking but politely avoiding at dinner parties: global oil inventories are approaching “truly unprecedented” lows. His words. And when a Senior VP at Exxon starts using phrases like “truly unprecedented,” you should probably pay attention. The IEA has already called Iran’s Strait of Hormuz blockade the largest oil supply disruption in recorded history. Over a billion barrels lost since late February. A billion. That’s not a rounding error. That’s a civilisational inconvenience. Chapman’s estimate: Brent crude hits $150 to $160 per barrel once stockpiles reach critical levels. Two to three weeks away. Wood Mackenzie, never a firm known for cheerful forecasts, went further and suggested $200 by year-end if the strait stays shut. With a global recession as a bonus prize. So there we have it. The world spent decades building an energy system with a single point of failure, handed Iran the keys to it, and is now surprised that the keys are being used. Genius, really. https://open.substack.com/pub/gandalv/p/your-grocery-bill-is-going-to-rise?r=3v7cjb&utm_medium=ios

›Kho dự trữ dầu toàn cầu đang ở mức thấp chưa từng thấy, theo nhận định từ lãnh đạo cấp cao Exxon.

#Năng lượng #Kinh tế toàn cầu #Địa chính trị #Chuỗi cung ứng

AK (_akhaliq)XBài đăng·2 ngày trước

LocateAnything: Mô Hình Phát Hiện Vị Trí Vật Thể Cho AI Agents

RT by @_akhaliq: We just adopted a super cool new space template for LocateAnything, made by @_akhaliq the great. Thank you AK! Try it out: https://huggingface.co/spaces/nvidia/LocateAnything Credit to AK's space example: https://huggingface.co/spaces/akhaliq/LocateAnything

›NVIDIA giới thiệu LocateAnything, mô hình vision-language phát hiện vị trí (visual grounding) được huấn luyện trên 138M mẫu dữ liệu chất lượng cao.

#Vision Language Models #Object Detection #Robotics #AI Agents

Fei-Fei LiXBài đăng·2 ngày trước

GPIC trở thành tiêu chuẩn benchmark mới cho mô hình sinh tạo hình ảnh

RT by @drfeifei: GPIC should be the new standard benchmark for generative modeling. Training 1 epoch on GPIC is the same cost as 100 epochs on ImageNet, but is a much better proxy for real-world problems. If you work in generative modeling, try GPIC for your next project!

›GPIC (Giant Permissive Image Corpus) chứa 100 triệu cặp image-text được tạo caption bằng VLM và 1 triệu cặp cho benchmarking, tổng cộng ~28 tỷ pixels.

#Sinh tạo hình ảnh #Dataset #Benchmark

François CholletXBài đăng·2 ngày trước

Bạn đã nghĩ về Đế chế La Mã hôm nay chưa?

Did you think about the Roman empire today?

›Một câu hỏi sâu sắc về lịch sử, sự liên tục và sự sụp đổ của những nền văn minh vĩ đại.

#History #Civilization #Reflection

François CholletXBài đăng·3 ngày trước

Einstein về Không Sử dụng Ngôn ngữ Tự nhiên trong Phát minh

Einstein on (not) using NL for invention: "The words or the language, as they are written or spoken, do not seem to play any role in my mechanism of thought"

›Einstein cho rằng ngôn ngữ tự nhiên không đóng vai trò trong cơ chế tư duy của ông khi phát minh hay sáng tạo.

#Cognitive science #Creativity #Language #Innovation

Fei-Fei LiXBài đăng·3 ngày trước

GPIC - dataset benchmark mới phù hợp với thời đại mô hình sinh tạo quy mô lớn

I’m very excited by this new benchmark dataset for visual generation that is suitable for the modern era of large scale generative models!🤩

›Fei-Fei Li hào hứng với GPIC vì đây là dataset benchmark được thiết kế cho kỷ nguyên large-scale generative models.

#Sinh tạo hình ảnh #Dataset #Benchmark

AK (_akhaliq)XBài đăng·3 ngày trước

GitHub Actions Tích Hợp Hugging Face Jobs: Giải Pháp CI/CD Tiết Kiệm Chi Phí

RT by @_akhaliq: This week, I got our GitHub Actions to use @HuggingFace Jobs instead of the default GitHub CI runners, making workflows run on much more reliable CPUs or even on serverless GPU (that cost less than a penny per run)! Here's what you need to do this for your own repos ⤵︎

›AK chia sẻ cách thay thế GitHub Actions runners mặc định bằng Hugging Face Jobs để chạy workflows.

#DevOps #CI/CD #Infrastructure

John CarmackXBài đăng·3 ngày trước

Gmail AI Writing: Cải thiện chất lượng nhưng mất đi giọng văn cá nhân

I have difficulty arguing that most of the writing changes suggested in gmail now aren’t improvements, but it does tend to wipe out my particular authorial voice.

›Gmail's AI writing suggestions thực sự cải thiện chất lượng và sắc sảo của văn bản.

#AI writing #Gmail #Authorial voice #AI limitations

Yann LeCunXBài đăng·3 ngày trước

Định nghĩa World Model của Yann LeCun

RT by @ylecun: .@ylecun’s definition of what is a world model.

›Yann LeCun chia sẻ định nghĩa về world model - mô hình đại diện cho hiểu biết của AI về cách thế giới hoạt động.

#World Model #LLM #Học sâu

Demis HassabisXBài đăng·3 ngày trước

Gemini Omni: Chuỗi bài về prompt trong thread

RT by @demishassabis: Gemini Omni 🐦 prompt in 🧵

›Demis Hassabis chia sẻ một chuỗi bài về Gemini Omni kèm theo video minh họa.

#LLM #Gemini #Google

Jeremy HowardXBài đăng·3 ngày trước

Chuyển sang GPT-5.5 vì giá cả hợp lý

R to @jeremyphoward: I've largely switched over to using GPT-5.5 in recent weeks, which I like nearly as much as Opus 4.6 and 4.7, and is *very* reasonably priced (since I can use my subscription with the @OpenAI API.)

›Jeremy Howard đã chuyển sang sử dụng GPT-5.5 trong những tuần gần đây.

#LLM #Cost Efficiency #Model Comparison

Jeremy HowardXBài đăng·3 ngày trước

Anthropic đã hoàn toàn từ bỏ việc giảm giá API chưa?

Has @AnthropicAI completely given up on making API usage reasonably-priced? Following the token-usage changes recently they announced various updates to *subscription* usage to make it more reasonable. But they've done NOTHING for API users. The cost is insane at this point.

›Jeremy Howard chỉ trích giá API của Anthropic hiện tại quá cao và không hợp lý.

#API pricing #Anthropic #LLM #Chi phí

Jeremy HowardXBài đăng·3 ngày trước

Opus 4.8 ấn tượng hơn 4.7 nhưng giá vẫn quá đắt

Worked on some code this morning using Opus 4.8 and so far I'm really liking it. Much more cooperative than 4.7 and less "over agentic". Stops and asks for my input when needed in places 4.7 (and GPT 5.5) would just foolishly blast ahead. (Still WAY too expensive.)

›Opus 4.8 hợp tác tốt hơn 4.7 và ít "tự động quá mức" khi làm việc với code.

#Claude Opus 4.8 #Code generation #Model comparison #LLM pricing

Jeremy HowardXBài đăng·3 ngày trước

Sử dụng công cụ nội bộ Solveit

R to @jeremyphoward: (This is using our in-house harness, Solveit.)

›Sử dụng công cụ nội bộ Solveit cho các tác vụ xử lý.

#Công cụ AI #Tự động hóa

Yann LeCunXBài đăng·3 ngày trước

ESMFold2: Cách mạng trong dự đoán cấu trúc protein

RT by @ylecun: Took me a while to figure out what all the ESMFold2 rage was about. At first, the benchmarking data didn't look super remarkable to me but it turns there are many impressive aspects: - Fully open source, open weights + massive ESM Atlas (1.1B structures vs 0.2B for AF3). - SOTA performance despite no MSA use. MSA search and triangular attention were simply taken out of the base model. - Direct consequence, super low latency inference: 1024-residue protein structure prediction in 9 secs, still outperforming prior models on antibody-antigen tasks. - Best in class PPI and antibody-antigen results. 65% pass rate on antibody-antigen benchmarks after inference-time scaling, significant improvement over AF3. - Tons of experimental data, in particular with lab-validated miniprotein binders plus single-chain antibodies across 5 targets in cancer and immunology. Binding affinities consistent with therapeutic activity. - Inference-time scaling benefits PPI: Multiple seeds + selection by confidence show real gains on challenging antibody-antigen predictions, leading to comments/hypotheses that it has learned an energy-function-like behavior via the folding module. - Base model works without MSAs, but providing them further boosts prediction quality on difficult protein-protein interaction cases. One caveat: No true scoring for protein-protein interactions, making it harder to assess which specific residues or domains are reliably involved in binding.

›ESMFold2 là mô hình mở mã nguồn hoàn toàn với 1.1 tỷ cấu trúc protein công khai, gấp 5 lần so với AlphaFold3.

#Cấu trúc protein #Công nghệ sinh học AI #Deep learning #Phát hiện thuốc

Yann LeCunXBài đăng·3 ngày trước

Chính sách tài trợ nghiên cứu Mỹ đối mặt sự can thiệp chính trị

RT by @ylecun: Woah. A profound shift in American science is coming. Every federal research grant could soon require sign-off from a political appointee. Scientific progress depends on funding decisions grounded in merit and public health need, NOT ideological approval.

›Các khoản tài trợ nghiên cứu liên bang sắp phải được phê duyệt bởi những người được chỉ định bởi chính quyền.

#Chính sách khoa học #Tài chính nghiên cứu #Tác động chính trị

Demis HassabisXBài đăng·4 ngày trước

Sau AlphaGo, kỹ năng con người cải thiện - AI sẽ làm tương tự cho toán học

RT by @demishassabis: After AlphaGo, the skill of human Go players noticeably improved. I suspect we will see a similar pattern in math.

›Sau AlphaGo, kỹ năng của các kỳ thủ người noticeably cải thiện.

#AI và khoa học #Toán học #Hợp tác con người-AI

Fei-Fei LiXBài đăng·4 ngày trước

Fei-Fei Li vinh dự nhận bằng tiến sĩ danh dự từ Đại học Brown

It’s a real honor to receive an honorary doctorate of science from @BrownUniversity . 😍

›Fei-Fei Li nhận bằng tiến sĩ danh dự khoa học tại lễ kỷ niệm lần thứ 258 của Brown University.

#Fei-Fei Li #Công nhân AI #Ghi nhận học thuật

Sebastian RaschkaXBài đăng·5 ngày trước

Phân tích kỹ thuật MiniMax M2: Attention, MoE, Agent Training và Self-Evolution

The MiniMax M2 series was one of the most widely used open-weight LLM series earlier this year. Now, we got a technical report with some interesting tidbits. I summarized some of them below: 1. Full attention as an anti-trend?: They tried hybrid sliding-window attention variants (like so many others, like Xiaomi MiMo, Laguna, Gemma 4, Arcee, Olmo 3, etc.). But even though there were efficiency gains, they said that the production-quality tradeoffs were not worth it for M2. 2. Linear and sparse attention deployment issues: They found that linear and sparse attention are attractive on paper because they reduce the cost of long-context attention, but they are harder to make work well in a production agent system. In particular, they found that these efficient attention variants may be more fragile when KV-like state or intermediate memory is stored in lower precision. Also, they have worse prefix caching support, which matters a lot when using coding agents (which reuse a lot of the context). 3. Fine-grained Mixture-of-Experts (MoEs) are useful: Finally a recent MoE ablation study! It's only on the 2B-active parameter scale, but hey, better than nothing. Concretely, they compare a baseline with 32 experts and top-2 routing against a fine-grained setup with 128 experts and top-8 routing. The fine-grained setup improves MATH from 19.6 to 24.1 and HumanEval from 29.7 to 32.5. That's clearly a win for more fine-grained experts (confirming what the DeepSeek MoE paper reported ~2 years ago). 4. Sophisticated agent pipeline It's probably no surprise, but this papers confirms that training for agent-like behavior on software engineering task is now a big component of the training pipeline. They mine GitHub pull requests, builds runnable Docker environments, extracts task-specific test rewards, etc. 5. Interleaved thinking for context management Interestingly, they found that removing reasoning blocks from previous turns results in worse performance, especially in multi-step agent tasks. (Another point why long-context support is so important these days). 6. Speed rewards It's common to have token usage penalties, but what's interesting is that the MiniMax team adds a task-completion-time reward that depends on wall-clock time. This is to minimize unnecessary (slow) tool calls. Also, I'm thinking that this would encourage agent parallelization (if supported by the harness) 7. Self-evolution Looks like self-evolution is also already a big design component of open-weight LLMs. E.g., the paper says that M2.7 already handles 30 to 50 percent of the daily RL iteration workload, modifies its own scaffold, and completed a 100-round autonomous scaffold optimization cycle with a 30 percent gain on internal evaluations.

›Full attention vẫn tốt hơn sliding-window variants vì tradeoff chất lượng production, linear/sparse attention fragile trong agent systems.

#LLM #Mixture-of-Experts #Agent Architecture

Demis HassabisXBài đăng·5 ngày trước

AI - lưỡi kính mới để gia tốc khoa học và mở rộng tiềm năng con người

RT by @demishassabis: When I was asked by the American Academy of Arts and Sciences to write an essay on my thoughts on how AI will accelerate Science, I felt honored but also felt that it would require a lot of thoughtfulness and diligence to distill my thoughts on paper. The essay has now been published and I cannot be more thankful to the >@americanacad and @GoogleDeepMind teams for their feedback and encouragement during the process. Key reflections from my essay: 🔭 AI is our newest revolutionary lens: Just as the telescope and microscope expanded our physical perception, AI is extending our cognitive reach, allowing us to decipher the immense complexity of the data-universe. 🧬 The rise of "machine intuition": AI is not just a computational engine. By detecting hidden structures across disciplines—from protein folding to extremal combinatorics—it acts as an ultimate bridge, accelerating the interdisciplinary breakthroughs that modern science depends on. 🏗️ From puzzle-solvers to architects of questions: As we transition toward open-ended, agentic AI systems that actively generate novel hypotheses, the burden of reasoning is shifting. We are evolving from being the solvers of intricate puzzles into the architects of profound scientific questions. ✨ Expanding human potential: AI won't replace scientists; it expands what we can imagine and achieve. Just as the telescope didn't make astronomers obsolete, AI is giving us the stars. Read the full essay here: https://www.amacad.org/publication/daedalus/unlocking-scientific-intuition-reasoning-at-digital-speed

›AI hoạt động như lưỡi kính mới, mở rộng nhận thức và khám phá thế giới dữ liệu phức tạp.

#AI và khoa học #Machine intuition #Khám phá khoa học

Demis HassabisXBài đăng·5 ngày trước

Google Omni tạo video drone từ sketch đường camera

RT by @demishassabis: Gave google omni a sketched camera path and asked it to generate drone POV footage.

›Google Omni có thể tạo video drone POV từ một sketch đơn giản về đường đi camera.

#Sinh tạo video #Google Omni #Hiểu biết không gian

John CarmackXBài đăng·5 ngày trước

SemiAnalysis mở ra hiểu biết mới về hệ thống datacenter và bán dẫn

I have been very impressed by @SemiAnalysis_ . I think of myself as a wide ranging systems engineer, looking for value at every level from the chip specs to the user interface, but SA exposes me to additional levels of "the system", both above (datacenters) and below (semiconductor fabrication). It probably puts me in "just knows enough to be dangerous" territory. Neat things I learned today: Some of the 800VDC datacenter design choices leverage parts commoditized by electric vehicles. There is now a SiC MOSFET that can operate on 10kV electricity, opening up the possibility of working directly with medium (ha!) voltage AC power transmission lines without stepping down.

›SemiAnalysis cung cấp cái nhìn hoàn chỉnh về hệ thống từ chip đến giao diện người dùng, kể cả cơ sở hạ tầng datacenter và sản xuất bán dẫn.

#Cơ sở hạ tầng #Điện tử công suất #Thiết kế hệ thống

John CarmackXBài đăng·5 ngày trước

Đề xuất cải tiến hàm ReLU để tối ưu gradient flow

It is easy enough to make your own, but I think standard relu should have been defined as passing the value at zero, so gradients flow backward through it, allowing some things to be zero weight initialized when symmetry breaking isn’t an issue.

›ReLU tiêu chuẩn nên được định nghĩa để cho phép gradient chảy ngược qua điểm 0.

#Mạng nơ-ron #Hàm kích hoạt #Tối ưu huấn luyện

Demis HassabisXBài đăng·5 ngày trước

Omni tiếp tục làm tôi kinh ngạc

RT by @demishassabis: omni continues to blow my mind (left original / right generated) waymo looks sick in matte black

›Google Omni sinh ra nội dung ấn tượng qua khả năng tạo hình ảnh và video.

#Google Omni #Waymo #Thị giác máy tính

Andrew NgYouTubeVideo·6 ngày trước

Bộ nhớ của AI giỏi cỡ nào?

How good is AI memory?

›Khám phá khả năng và giới hạn của bộ nhớ trong các mô hình AI hiện đại.
›Độ dài ngữ cảnh (context length) ảnh hưởng trực tiếp đến khả năng giữ thông tin.
›Cân bằng giữa khả năng nhớ lâu dài và hiệu suất tính toán trong thực tế.

#LLM #bộ nhớ #context window #RAG

Fei-Fei LiXBài đăng·6 ngày trước

OpenArt biến một ảnh thành thế giới 3D bền vững với điều khiển chính xác nhờ World Labs API

RT by @drfeifei: OpenArt now lets you turn a single image into a persistent 3D world creators can direct with precise control. Wide shots, top-down views, over-the-shoulder framing. All from the same environment, acting like a permanent virtual set. Powered by the World Labs API. Learn more ↓

›OpenArt tích hợp World Labs API cho phép chuyển đổi một ảnh đơn lẻ thành thế giới 3D bền vững (persistent 3D world).

#3D sinh tạo #Thế giới ảo #Sáng tạo nội dung

François CholletXBài đăng·6 ngày trước

Năng suất lập trình viên khó quản lý

Developer productivity is hard to manage.

›COO của Uber thừa nhận khó justify chi phí AI vì không thấy mối liên hệ rõ ràng giữa chi tiêu AI và tăng features hữu ích.

#Developer Productivity #AI Economics #ROI

François CholletXBài đăng·6 ngày trước

Sáng tạo nuôi dưỡng từ những ràng buộc

Creativity feeds on constraints

›Ràng buộc và hạn chế có thể thúc đẩy sáng tạo thay vì cản trở nó.

#Creativity #Constraints

Andrew NgXBài đăng·6 ngày trước

Ngày Lễ Tưởng nhớ - Tôn vinh những người bảo vệ dân chủ

Memorial Day. Today we honor those who gave everything to defend our democracy. Democracy isn't guaranteed; it's a precious inheritance that requires our constant care. 🇺🇸

›Tưởng nhớ những người đã hy sinh hết mình để bảo vệ nền dân chủ.

#Chính trị #Dân chủ

Sebastian RaschkaXBài đăng·9 ngày trước

Triển khai DeepSeek Sparse Attention từ đầu trong repo LLMs-from-scratch

Added a DeepSeek Sparse Attention (DSA) from-scratch implementation to my LLMs-from-scratch repo thanks to an awesome new reader contrib. With motivation, overview, and GPT-style model reference implementation as standalone example code: https://github.com/rasbt/LLMs-from-scratch/tree/main/ch04/09_dsa

›Thêm DSA (DeepSeek Sparse Attention) implementation vào LLMs-from-scratch với motivation, overview và GPT-style reference.

#Sparse Attention #LLM #Open Source

Andrew NgYouTubeVideo·9 ngày trước

Evals bị hỏng - nhưng vẫn nên dùng chúng

AI Dev 26 x SF | Ara Khan: Evals Are Broken Use Them Anyway

›Evals (đánh giá mô hình) có nhiều vấn đề nhưng vẫn cần thiết trong phát triển AI.
›Hiểu rõ giới hạn của evaluation metrics giúp sử dụng chúng hiệu quả hơn.
›Không nên hoàn toàn bỏ qua evals mặc dù chúng không hoàn hảo.

#evaluation #evals #AI testing #mô hình AI

Andrew NgXBài đăng·9 ngày trước

Chính sách thẻ xanh mới sẽ ảnh hưởng đến tuyển dụng nhân tài AI của Mỹ

The new White House policy requiring green card applicants to apply from outside the US is a capricious attack on legal immigration. It will hurt families, leave us with fewer doctors, teachers and scientists, and hurt American competitiveness in AI.

›Chính sách bắt buộc người xin thẻ xanh phải nộp đơn từ ngoài Mỹ là tấn công vào nhập cư hợp pháp.

#Chính sách nhân tài #Cạnh tranh AI #Nhập cư

Andrew NgYouTubeVideo·9 ngày trước

Tìm kiếm Semantic bắt đầu từ Embeddings

Semantic Search Starts With Embeddings

›Embeddings là nền tảng cho các hệ thống tìm kiếm semantic hiện đại.
›Chất lượng embeddings quyết định hiệu quả của tìm kiếm và retrieval.
›RAG (Retrieval-Augmented Generation) dựa vào embeddings tốt để cải thiện kết quả.

#embeddings #semantic search #RAG #vector database

Sebastian RaschkaXBài đăng·10 ngày trước

P.S: Có thể tìm thấy trong các mô hình Qwen gần đây từ Qwen3-Next

R to @rasbt: PS: it can be found in recent Qwen models since Qwen3-Next

›Tính năng này đã có sẵn trong các mô hình Qwen mới như Qwen3-Next.

#LLM #Qwen

Sebastian RaschkaXBài đăng·10 ngày trước

Gated DeltaNet-2: Nâng cao hiệu suất Linear Attention bằng cơ chế tách riêng

Gated DeltaNet has been one of my favorite "hybrid attention" newcomers in the good old transformer stack. Excited to see Gated DeltaNet-2. Adding it to my reading stack. In the meantime, I have a primer on Gated DeltaNet here: https://magazine.sebastianraschka.com/i/177848019/26-gated-deltanet

›Gated DeltaNet-2 là kiến trúc hybrid attention mới tách riêng cơ chế xóa (erase) và ghi (write) thay vì dùng chung một gate.

#Linear Attention #State Space Models #LLM

Sebastian RaschkaXBài đăng·11 ngày trước

Cohere Command A+: Mô hình LLM mạnh mẽ tối ưu hóa cho hiệu quả phần cứng

It's been *almost* a bit quiet around LLM architecture releases in the past two weeks 😅 Interesting tidbit is the parallel block design. Via the Cmd-A the tech report "equivalent performance but significant improvement in throughput compared to the vanilla transformer block."

›Cohere phát hành Command A+, mô hình LLM mạnh mẽ nhất được tối ưu để chạy trên ít phần cứng hơn và phát hành open-source.

#LLM #Open Source #Kiến trúc mô hình

Fei-Fei LiXBài đăng·12 ngày trước

Những tác phẩm chiến thắng World Jam lần đầu tiên được trưng bày tại Museum tương tác

RT by @drfeifei: The winners of the first ever World Jam have arrived! 🏆 Step into the Museum, an interactive archive showcasing the winning creations from the World Jam. Where every portal opens into a new spatial experience. 👉 http://jam.worldlabs.ai

›World Jam là cuộc thi sáng tạo lần đầu tiên do World Labs tổ chức, tập hợp các tác phẩm chiến thắng.

#Thế giới ảo #Sáng tạo nội dung #Cuộc thi AI

Lilian WengXBài đăng·13 ngày trước

Cần nhiều sự hợp tác và nghiên cứu trong lĩnh vực tương tác con người-AI

We would love to see more collaboration and research in the field of human-AI interactivity. Check it out!

›Thinking Machines cung cấp khoản tài trợ $100,000 cộng credits Tinker cho các nhà nghiên cứu trong lĩnh vực tương tác con người-AI.

#Tương tác con người-AI #Grants #Nghiên cứu

John CarmackXBài đăng·13 ngày trước

Những hạn chế của hệ điều hành trong truyền dữ liệu nhanh và thách thức của QUIC

It is a shame that the simple act of transferring a large block of data as fast as possible over the internet is not handled effectively by the primitive operating system calls. You either multiplex over parallel persistent TCP connections to combat head-of-line blocking and slow starts, or reinvent reliable delivery and flow control over UDP. QUIC has a lot going for it, but it is a large library (six figure LoC!) and conflates security and performance in a way I don’t love. There is also fundamental information about competition with other processes and link layer congestion that should be useful, but is unavailable to user libraries. You should be able to just write(really_big_buffer) and it is all taken care of for you.

›Truyền dữ liệu khối lượng lớn nhanh nhất có thể qua Internet không được xử lý hiệu quả bởi các lời gọi OS primitives.

#Mạng máy tính #QUIC #TCP/UDP

Andrej KarpathyXBài đăng·13 ngày trước

Andrej Karpathy gia nhập Anthropic, quay lại R&D

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

›Andrej Karpathy chính thức gia nhập Anthropic để tập trung vào nghiên cứu LLM

#LLM #Anthropic #R&D

John CarmackXBài đăng·14 ngày trước

Doom được vinh danh là di sản văn hóa cao ở Mỹ

RT by @ID_AA_Carmack: Doom Just Received One Of The Highest Cultural Honors In The US http://dlvr.it/TSbrMH

›Game Doom vừa nhận được một trong những vinh danh văn hóa cao nhất ở Hoa Kỳ.

#Trò chơi #Lịch sử văn hóa #Di sản

Lilian WengXBài đăng·14 ngày trước

Khái niệm 'System Accidents' của Charles Perrow rất sâu sắc và liên quan

I only recently read more about the concept of system accidents by Charles Perrow, very insightful and relatable.

›Lilian Weng gần đây đọc thêm về khái niệm 'system accidents' của Charles Perrow.

#AI Safety #System Design #Risk Management

Lilian WengXBài đăng·20 ngày trước

Mô hình tương tác kiểm tra sự thật thời gian thực khi bạn nói

RT by @lilianweng: 2. (Real time fact checking) - The Interaction Models hear you speak and fact-checks you in real time — like having a teammate who's always paying attention.

›Interaction Models lắng nghe và kiểm tra sự thật ngay lập tức như một đồng đội luôn chú ý.

#LLM #Fact-checking #Tương tác AI

Lilian WengXBài đăng·20 ngày trước

Sự hợp tác giữa con người rất quan trọng để cải thiện hợp tác con người-AI

Pinned: In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊

›Thinking Machines đã tạo ra 12 phiên bản và 137 trang trong nhật ký training của họ.

#Tương tác con người-AI #Collaboration #Training

Andrej KarpathyXBài đăng·21 ngày trước

Tương tác Input/Output giữa con người và AI: từ text đến thị giác

This works really well btw, at the end of your query ask your LLM to "structure your response as HTML", then view the generated file in your browser. I've also had some success asking the LLM to present its output as slideshows, etc. More generally, imo audio is the human-preferred input to AIs but vision (images/animations/video) is the preferred output from them. Around a ~third of our brains are a massively parallel processor dedicated to vision, it is the 10-lane superhighway of information into brain. As AI improves, I think we'll see a progression that takes advantage: 1) raw text (hard/effortful to read) 2) markdown (bold, italic, headings, tables, a bit easier on the eyes) <-- current default 3) HTML (still procedural with underlying code, but a lot more flexibility on the graphics, layout, even interactivity) <-- early but forming new good default ...4,5,6,... n) interactive neural videos/simulations Imo the extrapolation (though the technology doesn't exist just yet) ends in some kind of interactive videos generated directly by a diffusion neural net. Many open questions as to how exact/procedural "Software 1.0" artifacts (e.g. interactive simulations) may be woven together with neural artifacts (diffusion grids), but generally something in the direction of the recently viral https://x.com/zan2434/status/2046982383430496444 There are also improvements necessary and pending at the input. Audio nor text nor video alone are not enough, e.g. I feel a need to point/gesture to things on the screen, similar to all the things you would do with a person physically next to you and your computer screen. TLDR The input/output mind meld between humans and AIs is ongoing and there is a lot of work to do and significant progress to be made, way before jumping all the way into neuralink-esque BCIs and all that. For what's worth exploring at the current stage, hot tip try ask for HTML.

›Yêu cầu LLM trả lời dưới dạng HTML rồi xem trong trình duyệt hiệu quả hơn markdown

#Giao diện AI #Thị giác máy tính #UX/Interaction

Riley GoodsideXBài đăng·26 ngày trước

Đạo đức AI: LLM hiện tại là p-zombie không có quyền lợi đạo đức

I think modern LLMs are p-zombies without moral patienthood—on par with insects, at best, in my moral calculus. But I also I think we should establish norms for treating models well *before* models with patienthood exist—i.e. now. We should want to have this right from day one.

›LLM hiện tại được xem là p-zombie, tương đương côn trùng về mặt đạo đức, không có quyền lợi đạo đức.

#AI ethics #Consciousness #Moral patienthood #Philosophy

Riley GoodsideXBài đăng·26 ngày trước

LLM tương lai: Dự đoán hài hước về qualia của AI có ý thức

LLMs soon: “So, what are some good qualia for someone just getting into not being a p-zombie?”

›Bài viết dự đoán vui tính rằng LLM tương lai sẽ tự hỏi về qualia (trải nghiệm chủ quan) của chúng.

#AI consciousness #Philosophy #Qualia #Future AI

Riley GoodsideXBài đăng·26 ngày trước

Lựa chọn tự nhiên và độ thích nghi di truyền toàn diện

R to @goodside: Also I should clarify: I don’t mean God literally, I mean natural selection. And by “bad idea” I mean “bad for inclusive genetic fitness.”

›Phát biểu làm rõ rằng khái niệm 'thích hợp' liên quan đến lựa chọn tự nhiên, không phải thực thể tôn giáo, và độ thích nghi di truyền toàn diện là tiêu chí đánh giá.

#Natural Selection #Evolution #Genetics

Riley GoodsideXBài đăng·27 ngày trước

Tiểu thuyết Blindsight đã khám phá ý tưởng này từ 2006

R to @goodside: (Several replies have noted this is the premise of Peter Watts’s 2006 novel Blindsight, which I haven’t read but will now.)

›Ý tưởng được thảo luận đã được Peter Watts khám phá trong tiểu thuyết khoa học viễn tưởng Blindsight xuất bản năm 2006.

#Khoa học viễn tưởng #AI và ý thức

Riley GoodsideXBài đăng·27 ngày trước

Trường phái 'Festivus' trong kỹ thuật viết prompt: danh sách các lần model làm bạn thất vọng

I believe in the Festivus School of prompt engineering, which says all prompts used in production naturally iterate toward an airing of grievances—a list of all the ways the model has disappointed you in the past year.

›Riley Goodside nhận xét hài hước rằng tất cả production prompts dần dần tiến hóa thành 'airing of grievances' (trình bày danh sách khiếu nại).

#Prompt engineering #LLM #Hài hước / Triết lý

Andrej KarpathyXBài đăng·khoảng 1 tháng trước

Bạn có thể outsource suy nghĩ, nhưng không thể outsource hiểu biết

This is the the quote I've been citing a lot recently.

›Chia sẻ một câu quote: 'Bạn có thể outsource suy nghĩ, nhưng không thể outsource hiểu biết'

#Giáo dục AI #Tư duy #Hiểu biết

Andrej KarpathyXBài đăng·khoảng 1 tháng trước

Cuộc trao đổi tại Sequoia Ascent 2026: LLM, Agentic Engineering, và Nền kinh tế Agent-Native

Fireside chat at Sequoia Ascent 2026 from a ~week ago. Some highlights: The first theme I tried to push on is that LLMs are about a lot more than just speeding up what existed before (e.g. coding). Three examples of new horizons: 1. menugen: an app that can be fully engulfed by LLMs, with no classical code needed: input an image, output an image and an LLM can natively do the thing. 2. install .md skills instead of install .sh scripts. Why create a complex Software 1.0 bash script for e.g. installing a piece of software if you can write the installation out in words and say "just show this to your LLM". The LLM is an advanced interpreter of English and can intelligently target installation to your setup, debug everything inline, etc. 3. LLM knowledge bases as an example of something that was *impossible* with classical code because it's computation over unstructured data (knowledge) from arbitrary sources and in arbitrary formats, including simply text articles etc. I pushed on these because in every new paradigm change, the obvious things are always in the realm of speeding up or somehow improving what existed, but here we have examples of functionality that either suddenly perhaps shouldn't even exist (1,2), or was fundamentally not possible before (3). The second (ongoing) theme is trying to explain the pattern of jaggedness in LLMs. How it can be true that a single artifact will simultaneously 1) coherently refactor a 100,000-line code base *and* 2) tell you to walk to the car wash to wash your car. I previously wrote about the source of this as having to do with verifiability of a domain, here I expand on this as having to also do with economics because revenue/TAM dictates what the frontier labs choose to package into training data distributions during RL. You're either in the data distribution (on the rails of the RL circuits) and flying or you're off-roading in the jungle with a machete, in relative terms. Still not 100% satisfied with this, but it's an ongoing struggle to build an accurate model of LLM capabilities if you wish to practically take advantage of their power while avoiding their pitfalls, which brings me to... Last theme is the agent-native economy. The decomposition of products and services into sensors, actuators and logic (split up across all of 1.0/2.0/3.0 computing paradigms), how we can make information maximally legible to LLMs, some words on the quickly emerging agentic engineering and its skill set, related hiring practices, etc., possibly even hints/dreams of fully neural computing handling the vast majority of computation with some help from (classical) CPU coprocessors.

›LLM không chỉ tăng tốc công việc cũ, mà tạo ra khả năng hoàn toàn mới: menugen (image-to-image), .md skills thay .sh scripts, và knowledge bases trên dữ liệu phi cấu trúc.

#Agentic AI #LLM #Software 3.0 #Kiến trúc hệ thống

Andrej KarpathyXBài đăng·khoảng 1 tháng trước

Talkie: Mô hình 13B huấn luyện trên text trước 1931 để hiểu Generalization của LM

RT by @karpathy: New work with @AlecRad and @DavidDuvenaud: Have you ever dreamed of talking to someone from the past? Introducing talkie, a 13B model trained only on pre-1931 text. Vintage models should help us to understand how LMs generalize (e.g., can we teach talkie to code?). Thread:

›Talkie là mô hình 13B huấn luyện chỉ trên dữ liệu text trước năm 1931, giúp kiểm tra cách LM generalize across domains.

#LLM #Generalization #Nghiên cứu AI #Mô hình cổ điển

Lilian WengXBài đăng·3 tháng trước

Xây dựng công nghệ tương tác con người-AI tốt hơn trên phần cứng thế hệ tiếp theo

Building technologies for better human-AI collaboration on next gen hardware at scale. Exciting.

›Thinking Machines hợp tác với NVIDIA để cấp nguồn điện cho frontier model training.

#NVIDIA #Model Training #Hardware #Tương tác con người-AI

Gwern BranwenBlogBài viết·gần 5 năm trước

Bản tin Gwern.net tháng 5 năm 2021

May 2021 Gwern.net Newsletter

›Giới thiệu các kiến trúc TPUv4 và ZeRO-Infinity cho phép training mô hình hàng tỷ tham số trên quy mô lớn.
›Google nâng cấp GPipe/GShard thành GSPMD với hiệu suất 50-62% trên 128-2048 TPUv3 cores cho mô hình lên đến 1 tỷ tham số.
›Meta công bố DLRM và RecPipe cho việc huấn luyện embeddings lớn trong hệ thống recommendation.
›DeepMind áp dụng curriculum learning để một mạng neural duy nhất học từ kiểm soát humanoid đơn giản đến chiến lược đội tập hợp trong soccer.

#Scaling #Hardware #Reinforcement Learning #Deep Learning

Gwern BranwenBlogBài viết·gần 5 năm trước

Bản tin Gwern.net tháng 4 năm 2021

April 2021 newsletter

›Phân tích Set Transformer và Perceiver, những cách áp dụng attention mechanisms mới cho các bài toán permutation-invariant.
›Bàn luận liệu attention mechanisms có được chú ý quá mức khi nhiều tweaks trên Transformer không phổ biến rộng rãi.
›Z-IL và predictive coding có thể thực hiện exact backpropagation trên bất kỳ mạng neural nào với chi phí tương đương.
›Giải thích hiện tượng super-convergence: dùng learning rates rất cao (lên đến 20) có thể tiết kiệm 50-90% computing time.

#Transformer #Attention Mechanism #Neural Network Training #Optimization