Dòng tin
Tất cả
Bộ nhớ của AI giỏi cỡ nào?
How good is AI memory?
- ›Khám phá khả năng và giới hạn của bộ nhớ trong các mô hình AI hiện đại.
- ›Độ dài ngữ cảnh (context length) ảnh hưởng trực tiếp đến khả năng giữ thông tin.
- ›Cân bằng giữa khả năng nhớ lâu dài và hiệu suất tính toán trong thực tế.
Ngày Lễ Tưởng nhớ - Tôn vinh những người bảo vệ dân chủ
Memorial Day. Today we honor those who gave everything to defend our democracy. Democracy isn't guaranteed; it's a precious inheritance that requires our constant care. 🇺🇸
- ›Tưởng nhớ những người đã hy sinh hết mình để bảo vệ nền dân chủ.
Evals bị hỏng - nhưng vẫn nên dùng chúng
AI Dev 26 x SF | Ara Khan: Evals Are Broken Use Them Anyway
- ›Evals (đánh giá mô hình) có nhiều vấn đề nhưng vẫn cần thiết trong phát triển AI.
- ›Hiểu rõ giới hạn của evaluation metrics giúp sử dụng chúng hiệu quả hơn.
- ›Không nên hoàn toàn bỏ qua evals mặc dù chúng không hoàn hảo.
Chính sách thẻ xanh mới sẽ ảnh hưởng đến tuyển dụng nhân tài AI của Mỹ
The new White House policy requiring green card applicants to apply from outside the US is a capricious attack on legal immigration. It will hurt families, leave us with fewer doctors, teachers and scientists, and hurt American competitiveness in AI.
- ›Chính sách bắt buộc người xin thẻ xanh phải nộp đơn từ ngoài Mỹ là tấn công vào nhập cư hợp pháp.
Tìm kiếm Semantic bắt đầu từ Embeddings
Semantic Search Starts With Embeddings
- ›Embeddings là nền tảng cho các hệ thống tìm kiếm semantic hiện đại.
- ›Chất lượng embeddings quyết định hiệu quả của tìm kiếm và retrieval.
- ›RAG (Retrieval-Augmented Generation) dựa vào embeddings tốt để cải thiện kết quả.
Tại sao mỗi Agent AI cần một Simulation Sandbox
AI Dev 26 x SF | Andi Partovi: Why Every Agent Needs a Simulation Sandbox
- ›Sandbox mô phỏng cho phép agent kiểm tra hành động mà không ảnh hưởng thế giới thực.
- ›Mô phỏng giúp agent học và tối ưu hóa hành động trước khi triển khai.
- ›Bảo mật và an toàn của AI agent được nâng cao thông qua môi trường sandbox.
Giới hạn điểm A ở Harvard mâu thuẫn với triết lý giáo dục hỗ trợ toàn bộ học sinh
Harvard University just voted to limit the number of A grades given in undergraduate classes to about 20% of the class. I’m not in favor of this. It deeply runs counter to how I believe education should be. We should hold a high bar, but also work mightily to support the success of 100% of learners, rather than a fraction.
Harvard’s administration took this step — over the objections of a large fraction of the student body — to counter grade inflation. Grade inflation is real: Many universities have been awarding A and B grades to ever larger fractions of students, and this has caused grade point averages (GPAs) to become less useful as signals of student skill. At the same time, we want students to succeed. The heart of the question is the role of educational institutions. Should our goal be:
- To help students succeed?
- To judge students?
Both of these have value. But my focus when working in education is almost entirely helping students succeed.
To me, it is clear that many people want to learn, to be empowered, to build skills that let them do new things! This is what we focus on at DeepLearningAI. This philosophy is also why my online courses (going back to my early online Stanford courses on Coursera) permitted an unlimited number of retries for graded assignments.
I believe in letting — and even encouraging — someone to redo something until they succeed. This is as opposed to standing in judgement of the fact they didn’t get it right the first time. Further, I want homework assignments to be designed primarily to help people practice and learn, rather than to judge their skill level. This is why I prefer to create “Practice Problems” and “Practice Labs” — questions that, when you think through them, help you to gain practice and reinforce what you know. As opposed to “Assessment Problems” designed primarily to judge skill.
But won’t Harvard’s move make GPAs more meaningful and help prospective employers identify strong candidates? Having hired a large number of people from Harvard and other institutions, I can say confidently that GPA is not an important signal. We have screening and interviewing processes that give far more accurate ways to figure out if someone is truly skilled. I do not need a wider spread in applicant GPA scores to figure out who's really good!
To be clear, there is also value in assessment. Even though standardized testing is much hated, high-quality tests like the SAT, ACT, GRE, TOEFL, etc. provide objective measures of ability in a domain. I find that most people want to learn and succeed. There are also people who want rigorous assessment (for example, to apply for school admissions), but this is a lesser need, and is not my focus when building educational products.
Harvard is often described as an “elite” educational institution. There are two ways to be elite: One option involves limiting enrollments, and then even among admitted students, cap the number of people that do well at 20%. I would rather pursue a different path: Set a high bar and teach elite, cutting-edge skills, but strive relentlessly to help everyone succeed. This way, eliteness is defined not by excluding people but by helping as many people as possible to be excellent.
[Original text: The Batch newsletter]
- ›Harvard hạn chế điểm A cho sinh viên độc lập xuống ~20% để chống lạm phát điểm số.
Xây dựng Quy trình Công việc Doanh nghiệp Tái diễn với Quản lý và Nhúng
AI Dev 26 x SF | João Moura: Building Recurring, Governed, and Embedded Enterprise Workflows
- ›Các quy trình công việc AI trong doanh nghiệp cần được thiết kế để tái diễn một cách tự động và đáng tin cậy.
- ›Governance là yếu tố quan trọng để đảm bảo agents hoạt động theo các chính sách và quy tắc của tổ chức.
- ›Nhúng agents vào hệ thống hiện có giúp tạo giá trị ngay lập tức cho doanh nghiệp.
Agent Data Stack—Tại sao Mỗi Agent AI cần có Data Stack Riêng
AI Dev 26 x SF | Luke Kim: The Agent Data Stack—Why Every AI Agent Needs Its Own Data Stack
- ›Các AI agents cần có cơ sở hạ tầng dữ liệu riêng để truy cập, xử lý và lưu trữ thông tin một cách hiệu quả.
- ›Data stack chuyên biệt giúp agents truy cập nhanh chóng dữ liệu cần thiết cho quyết định.
- ›Tách biệt dữ liệu theo agent cải thiện bảo mật, hiệu suất và khả năng quản lý.
VibeML: Xây dựng Mô hình AI trong Vài giờ, không phải Vài tháng
AI Dev 26 x SF | Manos Koukoumidis & Stefan Webb: VibeML: Build your AI model in hours, not months
- ›VibeML cho phép các nhà phát triển tạo mô hình machine learning một cách nhanh chóng mà không cần chuyên môn sâu.
- ›Công cụ này rút ngắn thời gian phát triển từ hàng tháng xuống còn hàng giờ thông qua tự động hóa.
- ›Dân chủ hóa AI modeling giúp nhiều team khác nhau xây dựng các mô hình riêng của họ.
Flower SuperGrid Agents
AI Dev 26 x SF | Daniel Beutel: Flower SuperGrid Agents
- ›Flower SuperGrid cung cấp nền tảng để xây dựng các agents phân tán sử dụng federated learning.
- ›Công nghệ này cho phép nhiều agents hoạt động cùng nhau mà không cần tập trung dữ liệu.
- ›SuperGrid cải thiện khả năng mở rộng và bảo mật cho các hệ thống AI phân tán quy mô lớn.
Tối ưu hóa Độ chính xác, Chi phí và Độ trễ trong Agent Thực tế
AI Dev 26 x SF | Or Dagan: Optimizing Accuracy, Cost, and Latency in Real-World Agents
- ›Khi triển khai AI agents trong sản xuất, cần cân bằng giữa ba yếu tố: độ chính xác, chi phí tính toán và thời gian phản hồi.
- ›Các kỹ thuật tối ưu hóa có thể cải thiện hiệu suất agents mà không tăng chi phí quá mức.
- ›Đo lường và điều chỉnh các trade-off này là chìa khóa để triển khai agent thành công.
Multi-model Pipelines—Cách Đạt kết quả AI tốt hơn với Chi phí thấp hơn
AI Dev 26 x SF | Andrew Filev: Multi Model Pipelines—How to Get Better AI Results for Less
- ›Sử dụng nhiều mô hình AI khác nhau trong một pipeline có thể cải thiện chất lượng kết quả.
- ›Kết hợp các mô hình chuyên biệt cho các tác vụ khác nhau giúp tiết kiệm chi phí so với dùng một mô hình lớn.
- ›Multi-model pipelines cho phép tối ưu hóa hiệu suất cho từng bước xử lý cụ thể.
AI Dev 26 x SF | Diamond Bishop: 100 Đại Lý AI Tiếp Theo - Xây Dựng Văn Phòng Hỗ Trợ Đại Lý
AI Dev 26 x SF | Diamond Bishop: The Next 100 Agents. Building the Agent Native Office
- ›Bài nói về xây dựng các hệ thống văn phòng (office) tương thích với agent AI (agent-native).
- ›Tập trung vào kiến trúc và chiến lược quản lý hàng trăm đại lý AI cùng lúc.
- ›Khám phá cách tích hợp agent AI vào qui trình làm việc doanh nghiệp hiện đại.
AI Dev 26 x SF | Paul Everitt: Chuyển Dịch Sang Kỹ Thuật Agentic Engineering
AI Dev 26 x SF | Paul Everitt: The Shift to Agentic Engineering
- ›Phát triển phần mềm đang chuyển từ mô hình truyền thống sang agentic engineering.
- ›Đại lý AI sẽ trở thành các thành phần cơ bản trong kiến trúc ứng dụng tương lai.
- ›Cần thay đổi cách tiếp cận trong thiết kế, phát triển và kiểm thử để phù hợp với mô hình agent.
AI Dev 26 x SF | Andrew K. Davies: Bộ Nhớ Xác Định - Cách Xây Dựng AI Không Thể Nói Dối
AI Dev 26 x SF | Andrew K. Davies: Deterministic Memory: How to Build an AI That Cannot Lie
- ›Giới thiệu phương pháp deterministic memory để nâng cao độ tin cậy của hệ thống AI.
- ›Giải pháp này đảm bảo AI cung cấp thông tin nhất quán và chính xác mà không có mâu thuẫn.
- ›Ứng dụng trong việc xây dựng các đại lý AI đáng tin cậy cho các tác vụ quan trọng.
AI Dev 26 x SF | Thierry Damiba: Phát Hiện Bất Thường Trong Video từ Edge đến Cloud
AI Dev 26 x SF | Thierry Damiba: Edge to Cloud Video Anomaly Detection
- ›Sử dụng kỹ thuật xử lý video từ các thiết bị edge (cục bộ) đến cloud để phát hiện bất thường.
- ›Kết hợp tính toán tại edge và cloud tối ưu hóa hiệu suất, độ trễ và chi phí.
- ›Ứng dụng trong giám sát an ninh, phát hiện đe dọa và các hệ thống phân tích video real-time.
AI Dev 26 x SF | Brandon Waselnuk: Xây Dựng Công Cụ Context Mà Các Đại Lý AI Cần
AI Dev 26 x SF | Brandon Waselnuk: Building the Context Engine AI Agents Need
- ›Giới thiệu khái niệm context engine - thành phần giúp đại lý AI hiểu và xử lý thông tin ngữ cảnh.
- ›Context engine cho phép agent AI duy trì hiểu biết về tình huống, người dùng và môi trường xung quanh.
- ›Là nền tảng để xây dựng các đại lý AI thông minh, thích ứng cao và ra quyết định tốt hơn.
Khóa học: Xây dựng agent AI sinh hình ảnh và video với đánh giá tự động
New course: Build AI agents that generate images and videos -- an under-explored frontier. A key to performance is having the agent evaluate its own output, and iterate to improve quality. This short course is built together with @googlecloudtech and taught by Katie Nguyen and Wafae Bakkali.
You'll learn three evaluation techniques and combine them in an agent: image-text similarity scoring to check the output matches the prompt, an LLM judge that scores against custom criteria like brand consistency, and structured rubrics that break a prompt into verifiable yes/no questions like "is the subject in the frame?" and "does the camera motion match?"
Skills you'll gain:
- Learn image and video prompt engineering
- Build an image agent that turns brand guidelines into UI mockups
- Build a video agent that plans multi-scene explainers and animates reference frames with synchronized audio
Join and build agents that create images and video!
https://www.deeplearning.ai/courses/ai-agents-for-image-and-video-generation
- ›Khóa học dạy xây dựng agent AI sinh hình ảnh và video, một lĩnh vực chưa được khai phá sâu.
Khóa học: Transformer thực hành - Hiểu cách LLM hoạt động để tối ưu triển khai
New course: Transformers in Practice. You'll get a practical view of how transformer-based LLMs work, so you can reason about their behavior, diagnose problems like slow inference, and make smarter decisions about deployment. This course is built in partnership with @AMD and taught by @realSharonZhou.
You'll see how transformers generate text one token at a time, how the model decides which earlier words matter most when predicting the next one, and how techniques like quantization speed up inference on GPUs. This is not a video-only course; interactive visualizations throughout let you play with these concepts and build intuition that sticks.
Skills you'll gain:
- Understand why LLMs hallucinate, and RAG and chain-of-thought shape what they generate
- Look inside the model to see how attention and layers combine to predict the next token
- Diagnose inference bottlenecks and learn the techniques that speed up transformers on GPUs
Join and understand what's really happening inside your LLMs: https://www.deeplearning.ai/courses/transformers-in-practice
- ›Cung cấp cái nhìn thực hành về transformer-based LLM để suy luận hành vi và chẩn đoán vấn đề inference.
Không có thảm họa việc làm do AI
There will be no AI jobpocalypse.
The story that AI will lead to massive unemployment is stoking unnecessary fear. AI — like any other technology — does affect jobs, but telling overblown stories of large-scale unemployment is irresponsible and damaging. Let’s put a stop to it.
I’ve expressed skepticism about the jobpocalypse in previous posts. I’m glad to see that the popular press is now pushing back on this narrative. The image below features some recent headlines.
Software engineering is the sector most affected by AI tools, as coding agents race ahead. Yet hiring of software engineers remains strong! So while there are examples of AI taking away jobs, the trends strongly suggest the net job creation is vastly greater than the job destruction — just like earlier waves of technology. Further, despite all the exciting progress in AI, the U.S. unemployment rate remains a healthy 4.3%.
Why is the AI jobpocalypse narrative so popular? For one thing, frontier AI labs have a strong incentive to tell stories that make AI technology sound more powerful. At their most extreme, they promote science-fiction scenarios of AI “taking over” and causing human extinction. If a technology can replace many employees, surely that technology must be very valuable!
Also, a lot of SaaS software companies charge around $100-$1000 per user/year. But if an AI company can replace an employee who makes $100,000 — or make them 50% more productive — then charging even $10,000 starts to look reasonable. By anchoring not to typical SaaS prices but to salaries of employees, AI companies can charge a lot more.
Additionally, businesses have a strong incentive to talk about layoffs as if they were caused by AI. After all, talking about how they’re using AI to be far more productive with fewer staff makes them look smart. This is a better message than admitting they overhired during the pandemic when capital was abundant due to low interest rates and a massive government financial stimulus.
To be clear, I recognize that AI is causing a lot of people’s work to change. This is hard. This is stressful. (And to some, it can be fun.) I empathize with everyone affected. At the same time, this is very different from predicting a collapse of the job market.
Societies are capable of telling themselves stories for years that have little basis in reality and lead to poor society-wide decision making. For example, fears over nuclear plant safety led to under-investment in nuclear power. Fears of the “population bomb” in the 1960s led countries to implement harsh policies to reduce their populations. And worries about dietary fat led governments to promote unhealthy high-sugar diets for decades.
Now that mainstream media is openly skeptical about the jobpocalypse, I hope these stories will start to lose their teeth (much like fears of AI-driven human extinction have).
Contrary to the predictions of an AI jobpocalypse, I predict the opposite: There will be an AI jobapalooza! AI will lead to a lot more good AI engineering jobs, and I’m also optimistic about the future of the overall job market. What AI engineers do will be different from traditional software engineering, and many of these jobs will be in businesses other than traditional large employers of developers. In non-AI roles, too, the skills needed will change because of AI. That makes this a good time to encourage more people to become proficient in AI, and make sure they’re ready for the different but plentiful jobs of the future!
[Original text in The Batch newsletter.]
- ›Câu chuyện AI gây mất việc làm hàng loạt là quá thổi phồng - tạo công việc ròng lớn hơn mất công việc, giống như các sóng công nghệ trước đây.
Coursera và Udemy hợp nhất thành một công ty duy nhất
I'm delighted that @coursera and @udemy have come together as one company to serve learners.
Both Coursera and Udemy were founded with the belief that access to high-quality education changes lives. Over the years, both companies have advanced this goal, creating opportunities for individuals, organizations, and communities around the world.
That role is even more important now, as AI is changing the nature of work and increasing the need for continuous learning. Helping people build job-relevant skills will be critical to how we create a better world.
By combining the strengths of both companies, we can better serve this need. We bring together a broader range of learning content, trusted instructors and educators, and engaging learning experiences. This creates new opportunities to make learning more personalized, more applied, and more accessible at scale.
I’m excited to serve as Chairman of the combined company, working alongside Greg Hart and the leadership team. There is a strong foundation in both organizations, and I look forward to what the teams will build together to expand access opportunity globally.
Learn more: http://blog.coursera.org/coursera-and-udemy-are-now-one-company-creating-the-worlds-most-comprehensive-skills-platform/
- ›Coursera và Udemy hợp nhất để tạo nền tảng kỹ năng toàn diện nhất, kết hợp nội dung học rộng, hướng dẫn đáng tin cậy, và trải nghiệm hấp dẫn.
Khóa học mới: Xây dựng AI agent với UI hình thành động
New course: Build agents that respond to users with not only plaintext, but custom UIs like charts, forms, and whiteboards, generated on demand and displayed right in the chat. This short course is built in partnership with @CopilotKit and taught by @ataiiam, co-founder of CopilotKit.
You'll learn three approaches: Your agent can pick from custom components you build, like charts and forms. It can compose new layouts from a set of building blocks you provide, like rows, cards, and text. Or it can incorporate existing third-party apps, like a whiteboard or a calendar, right inside the conversation.
Skills you’ll gain:
- Build agents that render custom components like charts and forms on demand
- Build an app where the agent and user collaborate on shared data, beyond just the chat window
- Place third-party apps like maps, calendars, and whiteboards right in your interface
Join and build agents that give users something to see and act on! https://www.deeplearning.ai/short-courses/build-interactive-agents-with-generative-ui/
- ›Khóa học dạy xây dựng agent hiển thị UI tùy chỉnh như biểu đồ, biểu mẫu, bảng trắng được tạo nhanh và hiển thị trực tiếp trong chat.
Coding agent tăng tốc độ các loại công việc phần mềm theo mức độ khác nhau
Coding agents are accelerating different types of software work to different degrees. When we architect teams, understanding these distinctions helps us to have realistic expectations. Listing functions from most accelerated to least, my order is: frontend development, backend, infrastructure, and research.
Frontend development — say, building a web page to serve descriptions of products for an ecommerce site — is dramatically sped up because coding agents are fluent in popular frontend languages like TypeScript and JavaScript and frameworks like React and Angular. Additionally, by examining what they have built by operating a web browser, coding agents are now very good at closing the loop and iterating on their own implementations. Granted, LLMs today are still weak at visual design, but given a design (or if a polished design isn’t important), the implementation is fast!
Backend development — say, building APIs to respond to queries requesting product data — is harder. It takes more work by human developers to steer modern models to think through corner cases that might lead to subtle bugs or security flaws. Further, a backend bug can lead to non-intuitive downstream effects like a corrupted database that occasionally returns incorrect results, which can be harder to debug than a typical frontend bug. Finally, although database migrations can be easier with coding agents, they’re still hard and need to be handled carefully to prevent data loss. While backend development is much faster with coding agents, they accelerate it less, and skilled developers still design and implement far better backends than inexperienced ones who use coding agents.
Infrastructure. Agents are even less effective in tasks like scaling an ecommerce site to 10K active uses while maintaining 99.99% reliability. LLMs' knowledge is still relatively limited with respect to infrastructure and the complex tradeoffs good engineers must make, so I rarely trust them for critical infra decisions. Building good infrastructure often requires a period of testing and experimentation, and coding agents can help with that, but ultimately that’s a significant bottleneck where fast AI coding does not help much. Lastly, finding infrastructure bugs — say, a subtle network misconfiguration — can be incredibly difficult and requires deep engineering expertise. Thus, I’ve found that coding agents accelerate critical infrastructure even less than backend development.
Research. Coding agents accelerate research work even less. Research involves thinking through new ideas, formulating hypotheses, running experiments, interpreting them to potentially modify the hypotheses, and iterating until we reach conclusions. Coding agents can speed up the pace at which we can write research code. (I also use coding agents to help me orchestrate and keep track of experiments, which makes it easier for a single researcher to manage more experiments.) But there is a lot of work in research other than coding, and today’s agents help with research only marginally.
Categorizing software work into frontend, backend, infra, and research is an extreme simplification, but having a simple mental model for how much different tasks have sped up has been useful for how I organize software teams. For example, I now ask front-end teams to implement products dramatically faster than a year ago, but my expectations for research teams have not shifted nearly as much.
I am fascinated by how to organize software teams to use coding agents to achieve speed, and will keep sharing my findings in future posts.
[Original text: https://www.deeplearning.ai/the-batch/issue-350/ ]
- ›Front-end được tăng tốc độ rõ rệt nhất vì coding agent thành thạo TypeScript, JavaScript, React, Angular và lặp lại qua trình duyệt.
Cách prompt AI năm 2026 rất khác so với năm 2022 khi ChatGPT ra mắt
Pinned: How we prompt AI is very different in 2026 than 2022 when ChatGPT came out.
I'm teaching a new course, AI Prompting for Everyone, to help you become an AI power user — whatever your current skill level.
It covers skills that apply across ChatGPT, Gemini, Claude, and other AI tools. How to use deep research mode for well-researched reports on complex questions. How to give AI the right context, including more documents and images than most people realize you can provide. When to ask AI to think hard for several minutes on important decisions like what car to buy, what to study, or what job to take. And how to use AI to generate images, analyze data, and build simple games and websites.
I also cover intuitions about how these models work under the hood, so you know when to trust an answer and when not to.
Along the way, you'll see flying squirrels, a creativity test, some of my old family photos, and fireworks.
Join me at http://deeplearning.ai/courses/ai-prompting-for-everyone
- ›Prompt AI thay đổi rất nhiều từ 2022-2026 - khóa 'AI Prompting for Everyone' giúp trở thành power user trên ChatGPT, Gemini, Claude, v.v.
Đội phần mềm sinh ra cho AI hoạt động rất khác so với đội truyền thống
AI-native software engineering teams operate very differently than traditional teams. The obvious difference is that AI-native teams use coding agents to build products much faster, but this leads to many other changes in how we operate. For example, some great engineers now play broader roles than just writing code. They are partly product managers, designers, sometimes marketers. Further, small teams who work in the same office, where they can communicate face-to-face, can move incredibly quickly.
Because we can now build fast, a greater fraction of time must be spent deciding what to build. To deal with this project-management bottleneck, some teams are pushing engineer:product manager (PM) some teams are pushing engineer:product manager (PM) ratios downward from, say, 8:1 to as low as 1:1. But we can do even better: If we have one PM who decides what to build and one engineer who builds it, the communication between them becomes a bottleneck. This is why the fastest-moving teams I see tend to have engineers who know how to do some product work (and, optionally, some PMs who know how to do some engineering work). When an engineer understands users and can make decisions on what to build and build it directly, they can execute incredibly quickly.
I’ve seen engineers successfully expand their roles to including making product decisions, and PMs expand their roles to building software. The tech industry has more engineers than PMs, but both are promising paths. If you are an engineer, you’ll find it useful to learn some product management skills, and if you’re a PM, please learn to build!
Looking beyond the product-management bottleneck, I also see bottlenecks in design, marketing, legal compliance, and much more. When we speed up coding 10x or 100x, everything else becomes slow in comparison. For example, some of my teams have built great features so quickly that the marketing organization was left scrambling to figure out how to communicate them to users — a marketing bottleneck. Or when a team can build software in a day that the legal department needs a week to review, that’s a legal compliance bottleneck. In this way, agentic coding isn’t just changing the workflow of software engineering, it’s also changing all the teams around it.
When smaller, AI-enabled teams can get more done, generalists excel. Traditional companies need to pull together people from many specialties — engineering, product management, design, marketing, legal, etc. — to execute projects and create value. This has resulted in large teams of specialists who work together. But if a team of 2 persons is to get work done that require 5 different specialities, then some of those individuals must play roles outside a single speciality. In some small teams, individuals do have deep specializations. For example, one might be a great engineer and another a great PM. But they also understand the other key functions needed to move a project forward, and can jump into thinking through other kinds of problems as needed. Of course, proficiency with AI tools is a big help, since it helps us to think through problems that involve different roles.
Even in a two-person team, to move fast, communication bottlenecks also must be minimized. This is why I value teams that work in the same location. Remote teams can perform well too, but the highest speed is achieved by having everyone in the room, able to communicate instantaneously to solve problems.
This post focuses on AI-native teams with around 2-10 persons, but not everything can be done by a small team. I'll address the coordination of larger teams in the future.
I realize these shifts to job roles are tough to navigate for many people. At the same time, I am encouraged that individuals and small teams who are willing to learn the relevant skills are now able to get far more done than was possible before. This is the golden age of learning and building!
[Original text: https://www.deeplearning.ai/the-batch/issue-349/ ]
- ›Đội phần mềm sinh ra cho AI dùng coding agent xây dựng sản phẩm nhanh hơn nhiều, dẫn đến kỹ sư đóng vai trò rộng (engineer, PM, designer, marketer).
Khóa học mới: Phát triển dựa Spec với Coding Agent, hợp tác JetBrains
New course: Spec-Driven Development with Coding Agents, built in partnership with @jetbrains, and taught by @paulweveritt.
Vibe coding is fast, but often produces code that doesn't match what you asked for. This short course teaches you spec-driven development: write a detailed spec defining what to build, and work with your coding agent to implement it. Many of the best developers already build this way.
A spec lets you control large code changes with a few words, preserve context across agent sessions, and stay in control as your project grows in complexity.
Skills you'll gain:
- Write a detailed specification to define your mission, tech stack, and roadmap, giving your agent the context it needs from the start
- Plan, implement, and validate features in iterative loops using a spec as your agent's guide
- Apply the same repeatable workflow to both new and legacy codebases
- Package your workflow into a portable agent skill that works across agents and IDEs
Join and write specs that keep your coding agent on track!
https://www.deeplearning.ai/short-courses/spec-driven-development
- ›Spec-driven development giúp kiểm soát code changes lớn bằng vài từ và bảo tồn context giữa các agent sessions.
Voice UI kết hợp giọng nói và cập nhật hình ảnh trong thời gian thực
I'm excited about voice as a UI layer for existing visual applications — where speech and screen update together. This goes well beyond voice-only use cases like call center automation.
The barrier has been a hard technical tradeoff: low-latency voice models lack reliability, while agentic pipelines (speech-to-text → LLM → text-to-speech) are intelligent but too slow for conversation. Ashwyn Sharma and team at Vocal Bridge (an AI Fund portfolio company) address this with a dual-agent architecture: a foreground agent for real-time conversation, a background agent for reasoning, guardrails, and tool calls.
I used Vocal Bridge to add voice to a math-quiz app I'd built for my daughter; this took less than an hour with Claude Code. She speaks her answers, the app responds verbally and updates the questions and animations on screen.
Only a tiny fraction of developers have ever built a voice app. If you'd like to try building one, check out Vocal Bridge for free: https://vocalbridgeai.com
- ›Voice UI vượt xa voice-only use case, cho phép speech và screen update cùng nhau.
Tương lai của kỹ thuật phần mềm khi AI agent tăng tốc phát triển code
As AI agents accelerate coding, what is the future of software engineering? Some trends are clear, such as the Product Management Bottleneck, referring to the idea that we are more constrained by deciding what to build rather than the actual building. But many implications, like AI’s impact on the job market, how software teams will be organized, and more, are still being sorted out.
The theme of our AI Developer Conference on April 28-29 in San Francisco is The Future of Software Engineering. I look forward to speaking about this topic there, hearing from other speakers on this theme, and chatting with attendees about it. We’re shaping the future, and I hope you will join me there!
It is currently trendy in some technology and policy circles to forecast massive job losses due to AI. Even if they have not yet materialized, these losses certainly must be just over the horizon! I have a contrarian view that the AI jobpocalypse — the notion that AI will lead to massive unemployment, perhaps even rioting in the streets — won’t be nearly as bad as dire forecasts by pundits, especially pundits who are trying to paint a picture of how powerful their AI technology is.
Among professions, AI is accelerating software engineering most, given the rise of coding agents. According to a new report by Citadel Research, software engineering job postings are rising rapidly. So if software engineering is a harbinger of the impact AI will have on other professions, this expansion of software engineering jobs is encouraging.
Yes, fresh college graduates are having a hard time finding jobs. And yes, there have been layoffs that CEOs have attributed to AI, even if a large fraction of this was “AI washing,” where businesses choose to attribute layoffs to AI, even though AI has not changed their internal operations much yet. And yes, there is a subset of job roles, such as call center operator, that are more heavily impacted. Many people are feeling significant job insecurity, and I feel for everyone struggling with employment, whether or not the cause is AI-related. And many other factors, such as over-hiring during the pandemic and high interest rates, have contributed to the slowdown in the labor market, and the notion that AI is leading to unemployment is oversimplified.
In software engineering, I see a lot of exciting work ahead to adapt our workflows. It is already clear that: (i) As AI makes coding easier, a lot more people will be doing it. (ii) Writing code by hand and even reading (generated) code is not that important, because we can ask an LLM about the code and operate at a higher level than the raw syntax (although how high we can or should go is rapidly changing). (iii) There will be a lot more custom applications, because now it’s economical to write software for smaller and smaller audiences. (iv) Deciding what to build, more than the actual building, is becoming a bottleneck. (v) The cost of paying down technical debt is decreasing (since AI can refactor for you).
At the same time, there are also a lot of open questions for our profession, such as:
- In the future, what will be the key skills of a senior software engineer? And for junior levels, what should be the new Computer Science curriculum?
- If everyone can build features, what skills, strategies, or resources create competitive advantage for individuals and for businesses?
- What are the new building blocks (libraries, SDKs, etc.) of software? How do we organize coding agents to create software?
- What should a software team look like? For example, how many engineers, product managers, designers, and so on. What tooling do we need to manage their workflow?
- How do AI agents change the workflow of machine learning engineers and data scientists? For example, how can we use agents to accelerate exploring data, identifying hypotheses, and testing them?
I’m excited to explore these and other questions about the future of software engineering at AI Dev. I expect this to be an exciting event. Please join us!
[Original text: The Batch newsletter.]
https://ai-dev.deeplearning.ai/
- ›Product Management Bottleneck: vấn đề chính là quyết định xây gì, chứ không phải xây cách nào.
Khóa học: Suy luận hiệu quả với SGLang cho sinh text và hình ảnh
New course: Efficient Inference with SGLang: Text and Image Generation, built in partnership with LMSys @lmsysorg and RadixArk @radixark, and taught by Richard Chen @richardczl, a Member of Technical Staff at RadixArk.
Running LLMs in production is expensive, and much of that cost comes from redundant computation. This short course teaches you to eliminate that waste using SGLang, an open-source inference framework that caches computation already done and reuses it across future requests.
When ten users share the same system prompt, SGLang processes it once, not ten times. The speedups compound quickly, especially when there's a lot of shared context across requests.
Skills you'll gain:
- Implement a KV cache from scratch to eliminate redundant computation within a single request
- Scale caching across users and requests with RadixAttention, so shared context is only processed once
- Accelerate image generation with diffusion models using SGLang's caching and multi-GPU parallelism
Join and learn to make LLM inference faster and more cost-efficient at scale!
https://www.deeplearning.ai/short-courses/efficient-inference-with-sglang-text-and-image-generation
- ›SGLang framework giảm chi phí inference bằng caching computation và tái sử dụng qua requests.