#vision-based-systems

[ follow ]
Artificial intelligence
fromWIRED
12 hours ago

They Built the 'Cursor for Hardware.' Now, Anthropic Wants In

Creativity in hardware development is being unlocked by new tools and APIs, enabling more makers and developers to innovate.
fromFuturism
1 day ago

Robot Dogs Patrolling Precious Crops as Food Crisis Deepens

Bayer is supplementing human security patrols around its 8,000 acre Hawaiian corn farm with robotic security dogs, supplied by the tech firm Asylon. The Asylon dogs are meant to guard the company's precious maize from vandals, wildfires, wild fauna, and other hazards around the clock.
Agriculture
fromFast Company
1 day ago

How AI and education are shaping the future of aesthetics

Aesthetic inspiration is social and collective, but aesthetic results are deeply personal. What works for one face, skin type, or bone structure won't always work for another.
Healthcare
#robotics
Startup companies
fromTechCrunch
2 days ago

This simulation startup wants to be the Cursor for physical AI | TechCrunch

Engineers aim to program physical agents like digital ones, but robotics faces challenges due to limited data and the need for realistic simulations.
Artificial intelligence
fromTechCrunch
2 days ago

Physical Intelligence, a hot robotics startup, says its new robot brain can figure out tasks it was never taught | TechCrunch

Physical Intelligence's π0.7 model enables robots to perform unfamiliar tasks through compositional generalization, marking a significant advancement in robotic AI capabilities.
Artificial intelligence
fromArs Technica
3 days ago

Robot dogs now read gauges and thermometers using Google Gemini

Robots can now accurately read analog instruments thanks to Google DeepMind's Gemini Robotics-ER 1.6 model, enhancing their embodied reasoning capabilities.
Science
fromNature
2 weeks ago

Inside the 'self-driving' lab revolution

Eve, an AI-powered robotic platform, automates early-stage drug design, significantly enhancing efficiency in scientific research.
Startup companies
fromTechCrunch
2 days ago

This simulation startup wants to be the Cursor for physical AI | TechCrunch

Engineers aim to program physical agents like digital ones, but robotics faces challenges due to limited data and the need for realistic simulations.
Artificial intelligence
fromTechCrunch
2 days ago

Physical Intelligence, a hot robotics startup, says its new robot brain can figure out tasks it was never taught | TechCrunch

Physical Intelligence's π0.7 model enables robots to perform unfamiliar tasks through compositional generalization, marking a significant advancement in robotic AI capabilities.
Artificial intelligence
fromArs Technica
3 days ago

Robot dogs now read gauges and thermometers using Google Gemini

Robots can now accurately read analog instruments thanks to Google DeepMind's Gemini Robotics-ER 1.6 model, enhancing their embodied reasoning capabilities.
Science
fromNature
2 weeks ago

Inside the 'self-driving' lab revolution

Eve, an AI-powered robotic platform, automates early-stage drug design, significantly enhancing efficiency in scientific research.
#ai
Python
fromPycon
1 week ago

Python and the Future of AI: Agents, Inference, and Edge AI

AI tools are increasingly integrated into development, with a dedicated track at PyCon US focusing on their future and practical applications.
London startup
fromwww.bbc.com
1 day ago

Could a digital twin make you into a 'superworker'?

Digital Richard is an AI twin that assists Richard Skellett in business and personal decision-making, serving as a model for digital twins at Bloor Research.
Mobile UX
fromGSMArena.com
4 days ago

The Honor 600 series will bring the much-improved AI Image to Video 2.0 feature

Honor's upcoming 600 series features AI Image to Video 2.0, enabling users to create videos from images and text commands.
Cars
fromStreetsblog USA
5 days ago

Can This Tool Predict Where Your City's Next Car Crash Will Happen? - Streetsblog USA

A new AI tool, StreetVision, predicts traffic crash hotspots to enhance road safety and inform preventive measures.
Python
fromPycon
1 week ago

Python and the Future of AI: Agents, Inference, and Edge AI

AI tools are increasingly integrated into development, with a dedicated track at PyCon US focusing on their future and practical applications.
#virtual-reality
Growth hacking
fromInfoQ
2 days ago

From VR to Flat Screens: Bridging the Input and Immersion Gap

The future of VR and wearables is focused on natural interaction and immersion, despite current market limitations.
Psychology
fromPsychology Today
2 days ago

The Science of Seeing Differently Through Virtual Reality

Virtual reality can immerse individuals in experiences of bias, but it may also reinforce existing prejudices if not carefully designed.
Growth hacking
fromInfoQ
2 days ago

From VR to Flat Screens: Bridging the Input and Immersion Gap

The future of VR and wearables is focused on natural interaction and immersion, despite current market limitations.
Psychology
fromPsychology Today
2 days ago

The Science of Seeing Differently Through Virtual Reality

Virtual reality can immerse individuals in experiences of bias, but it may also reinforce existing prejudices if not carefully designed.
Software development
fromInfoWorld
3 days ago

Mastering the dull reality of sexy AI

The gap in enterprise AI lies in building effective systems for retrieval, evaluation, memory, and governance, not just access to models.
#ai-in-education
fromPsychology Today
2 days ago
Education

Artificial Intelligence in Education Needs Design, Not Devotion

AI's impact on education varies based on its integration into the curriculum, influencing both performance and the depth of learning.
Online learning
fromeLearning Industry
4 days ago

Rethinking Education With AI: Create More Engaging Learning Experiences With AI-Powered Learning Design

AI can enhance learning design by personalizing experiences and improving relevance, but risks of generic content and diminished critical thinking remain.
Education
fromPsychology Today
2 days ago

Artificial Intelligence in Education Needs Design, Not Devotion

AI's impact on education varies based on its integration into the curriculum, influencing both performance and the depth of learning.
Online learning
fromeLearning Industry
4 days ago

Rethinking Education With AI: Create More Engaging Learning Experiences With AI-Powered Learning Design

AI can enhance learning design by personalizing experiences and improving relevance, but risks of generic content and diminished critical thinking remain.
Online learning
fromeLearning
2 days ago

The Role of AI in Modern Technical Training Programs Across Industries - eLearning

Technical training is essential for applying skills in real work situations, enhanced by AI for better design and delivery.
#agentic-ai
Software development
fromTechCrunch
3 days ago

OpenAI updates its Agents SDK to help enterprises build safer, more capable agents | TechCrunch

OpenAI's updated SDK enhances agent development with sandboxing and in-distribution harness features for safer, more complex automated tasks.
Software development
fromTechCrunch
3 days ago

OpenAI updates its Agents SDK to help enterprises build safer, more capable agents | TechCrunch

OpenAI's updated SDK enhances agent development with sandboxing and in-distribution harness features for safer, more complex automated tasks.
European startups
fromTechCrunch
3 days ago

Chipmakers AMD, Arm, and Qualcomm are all investing in this buzzy self-driving tech startup | TechCrunch

Chipmakers AMD, Arm, and Qualcomm invested $60 million in Wayve's self-driving technology, enhancing its $1.2 billion Series D funding round.
fromwww.npr.org
4 days ago

In the brain, objects seen and imagined follow the same neural path

"I can look at an object in the world around me, but I can also close my eyes and imagine the object," says Varun Wadia, highlighting the dual capability of visual perception and imagination.
Science
#humanoid-robots
Pets
fromFuturism
5 days ago

Video Shows Humanoid Robot Chasing a Pack of Wild Boars

Humanoid robots are entertaining the public through various stunts, including chasing wild boars and performing in public events.
Artificial intelligence
fromComputerworld
3 weeks ago

How digital brains for humanoid robots are being built

Humanoid robots have significantly improved in functionality and behavior over the past year, exemplified by Olaf's performance at Nvidia's GTC event.
Pets
fromFuturism
5 days ago

Video Shows Humanoid Robot Chasing a Pack of Wild Boars

Humanoid robots are entertaining the public through various stunts, including chasing wild boars and performing in public events.
Artificial intelligence
fromComputerworld
3 weeks ago

How digital brains for humanoid robots are being built

Humanoid robots have significantly improved in functionality and behavior over the past year, exemplified by Olaf's performance at Nvidia's GTC event.
Graphic design
fromGeeky Gadgets
5 days ago

Inside Seedance 2.0 : The Next Big Shift in AI Video Generation

Seedance 2.0 enhances AI-driven video production with lifelike avatars, precise sound synchronization, and customizable features for diverse industries.
Marketing tech
fromForbes
5 days ago

How AI Interfaces Are Reshaping Discovery, Trust And Decision Making

The traditional home page is losing its significance as AI assistants reshape how users interact with brands online.
#apple
Apple
fromTNW | Apple
5 days ago

Apple testing four frame designs for AI smart glasses ahead of 2027 launch

Apple is testing four frame styles for AI-powered smart glasses, targeting production in December 2026 and a launch in spring or summer 2027.
Apple
fromTNW | Apple
5 days ago

Apple testing four frame designs for AI smart glasses ahead of 2027 launch

Apple is testing four frame styles for AI-powered smart glasses, targeting production in December 2026 and a launch in spring or summer 2027.
Productivity
fromPerevillega
3 weeks ago

Building Agent Memory That Survives Between Sessions | Pere Villega

Memory in Claude Code sessions is a design problem requiring deliberate creation of context to avoid repetitive explanations.
New York City
fromInsideHook
6 days ago

Can Self-Driving Cars Help Fix the Nation's Potholes?

Pothole repairs are gaining attention, with NYC filling 100,000 potholes and Waymo partnering with Waze to identify potholes using autonomous vehicles.
#ai-agents
Software development
fromTechzine Global
2 days ago

OpenAI's new Agents SDK focuses on safety and scalability

OpenAI's updated Agents SDK enables developers to create autonomous AI agents for complex tasks with enhanced usability and a sandbox environment.
Software development
fromTechzine Global
2 days ago

OpenAI's new Agents SDK focuses on safety and scalability

OpenAI's updated Agents SDK enables developers to create autonomous AI agents for complex tasks with enhanced usability and a sandbox environment.
Education
fromFast Company
3 days ago

The future of AI in schools isn't personalized learning

Personalized learning through AI often results in device-mediated instruction, lacking the essential role of teachers in student development.
fromComputerWeekly.com
5 days ago

Qualcomm expands strategic advanced driver assistance systems, immersive eyewear collaborations | Computer Weekly

Qualcomm is helping address one of the auto industry's most pressing needs - scaling intelligent vehicle technology to meet growing consumer demand for vehicles that are automated, connected and highly personalised.
European startups
#artificial-intelligence
Artificial intelligence
fromTechCrunch
6 days ago

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.
Science
fromNature
5 days ago

Human scientists trounce the best AI agents on complex tasks

The number of natural science publications mentioning AI grew nearly 30-fold from 2010 to 2025, indicating rapid adoption by scientists.
Python
fromBusiness Matters
3 weeks ago

Building AI-powered visual solutions: How Python forms the foundation for advanced Computer Vision use cases

Python is the preferred programming language for developing computer vision technologies due to its simplicity, flexibility, and extensive libraries.
Artificial intelligence
fromTechCrunch
6 days ago

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.
Data science
fromInfoWorld
2 weeks ago

Why 'curate first, annotate smarter' is reshaping computer vision development

Strategic data selection and curation reduce annotation costs and enhance development productivity in computer vision teams.
Python
fromMathspp
1 week ago

uv skills for coding agents

Utilizing uv workflows enhances Python code execution and script management for coding agents, ensuring proper handling of dependencies and sandboxing.
Roam Research
fromFast Company
2 weeks ago

How AI-powered echolocation is giving small drones night vision

An ultrasound-based perception system inspired by bat echolocation enables small aerial robots to navigate in low-visibility environments.
#autonomous-vehicles
DevOps
fromInfoQ
2 weeks ago

Optimization in Automated Driving: From Complexity to Real-Time Engineering

A production-grade AV stack is a distributed dataflow graph of components, optimized for resource management and real-time constraints.
Mission District
fromMedium
1 month ago

What is teleoperation?

Autonomous vehicles require invisible design infrastructure beyond sensors and algorithms to handle real-world complexity and edge cases at scale.
DevOps
fromInfoQ
2 weeks ago

Optimization in Automated Driving: From Complexity to Real-Time Engineering

A production-grade AV stack is a distributed dataflow graph of components, optimized for resource management and real-time constraints.
Mission District
fromMedium
1 month ago

What is teleoperation?

Autonomous vehicles require invisible design infrastructure beyond sensors and algorithms to handle real-world complexity and edge cases at scale.
Software development
fromTechCrunch
5 days ago

Microsoft is working on yet another OpenClaw-like agent | TechCrunch

Microsoft is testing OpenClaw-like features for its Microsoft 365 Copilot tool aimed at enterprise customers with enhanced security controls.
Artificial intelligence
fromTheregister
3 days ago

LLMs fail in 8 out of 10 early differential diagnosis cases

AI models fail at early differential diagnosis in over 80% of cases, highlighting significant limitations for patient self-diagnosis.
Software development
fromMedium
6 days ago

GAIA by AMD - Running Intelligent Systems Fully on Your Own Machine

GAIA is an open-source framework enabling local execution of intelligent agents, eliminating external dependencies and enhancing data control.
fromBusiness Insider
3 weeks ago

Why fully self-driving cars are almost impossible

Despite significant investments and technological advancements, the reality is that no vehicle currently operating on public roads can be classified as fully autonomous. The complexities of real-world driving conditions present insurmountable challenges.
Cars
Artificial intelligence
fromMedium
6 days ago

Why Your AI System Is Open-Loop

Open-loop AI systems audit spending after the fact, while closed-loop systems proactively control costs through continuous measurement and adjustment.
fromTechCrunch
1 month ago

Memories.ai is building the visual memory layer for wearables and robotics | TechCrunch

AI is already doing really well in the digital world, what about the physical world? AI wearables, robotics need memories as well. ... Ultimately, you need AI to have visual memories. We believe in that future.
Wearables
Marketing tech
fromTNW | Microsoft
4 weeks ago

Microsoft's MAI-Image-2 enters the top three AI image generators

Microsoft's MAI-Image-2 ranks third on Arena.ai's image generation leaderboard, behind only Google and OpenAI, and is now rolling out across Copilot and Bing Image Creator.
fromGreaterwrong
1 week ago
Artificial intelligence

My picture of the present in AI

AI companies are experiencing significant productivity increases through the integration of advanced AI tools, achieving a speed-up of around 1.6x.
Marketing tech
fromeLearning Industry
1 month ago

D-ID Launches V4 Expressive Visual Agents For Real-Time AI Interaction

D-ID launches V4 Expressive Visual Agents, ultra-high-fidelity AI avatars enabling real-time LLM conversations and enterprise video content with sub-0.5-second latency and 4K resolution.
Artificial intelligence
fromTheregister
2 weeks ago

Microsoft shivs OpenAI with new AI models for speech, images

Microsoft launched public preview versions of machine learning models for speech recognition, speech synthesis, and image generation, competing directly with OpenAI.
Artificial intelligence
fromFortune
2 weeks ago

Is AI's visual understanding mostly a 'mirage'? New research suggests so. | Fortune

Anthropic faces significant cybersecurity risks following multiple sensitive data leaks related to its new AI model, Mythos.
Tech industry
fromTechCrunch
2 months ago

Lidar-maker Ouster buys vision company StereoLabs as sensor consolidation continues | TechCrunch

Ouster acquired StereoLabs for $35 million and 1.8 million shares to integrate vision-based perception with lidar and build a unified sensing platform.
Gadgets
fromFuturism
2 months ago

This Robot With a Working Human Face Is Incredibly Unsettling

DroidUP unveiled Moya, a warm humanoid robot with human-like heated skin, animated facial expressions, pupil tracking, and bipedal walking capability.
Mobile UX
fromFast Company
1 month ago

Could robot phones be the next leap in physical AI?

Smartphone design has become a physical constraint on creativity; future devices must rethink form factors, prioritize creation over consumption, and integrate AI into physical space.
Artificial intelligence
fromFuturism
1 month ago

AI Data Center Security Guards Are Not Human

AI companies are deploying robot security guards, particularly Boston Dynamics' Spot, to patrol massive data centers and reduce labor costs.
Privacy technologies
fromFlowingData
2 months ago

Meta planning facial recognition with glasses

Meta plans to add facial recognition to outward-facing smart-glass cameras that record what users look at, creating significant privacy and trust concerns.
Mobile UX
fromEngadget
1 month ago

Google's Circle to Search can now identify multiple objects in an image

Google's updated Circle to Search now identifies multiple objects simultaneously using Gemini 3, enabling comprehensive shopping searches and complex relationship analysis between image elements.
Python
fromPyImageSearch
1 month ago

SAM 3 for Video: Concept-Aware Segmentation and Object Tracking - PyImageSearch

SAM3 extends beyond static image segmentation to video by maintaining streaming memory and tracking state, enabling unified detection, segmentation, and tracking across frames while preserving object identity over time.
Python
fromPyImageSearch
2 months ago

Grounded SAM 2: From Open-Set Detection to Segmentation and Tracking - PyImageSearch

Grounded SAM 2 extends Grounding DINO by adding pixel-level segmentation and video-aware tracking to convert language-driven detections into precise, persistent object masks.
Python
fromPyImageSearch
2 months ago

Advanced SAM 3: Multi-Modal Prompting and Interactive Segmentation - PyImageSearch

SAM 3 enables flexible multi-modal segmentation using combined text, spatial, and interactive prompts for precise, production-ready workflows.
Artificial intelligence
fromInfoQ
2 months ago

How to Unlock Insights and Enable Discovery Within Petabytes of Autonomous Driving Data

Edge cases in autonomous driving are rare but critical scenarios that must be identified, retrieved, and included to ensure model safety and robustness.
fromwww.scientificamerican.com
1 month ago

How LabOS AI-powered smart goggles could reduce human error in science

Laboratory safety goggles have finally joined the ranks of smart devices. That's the promise behind LabOS, an AI operating system for scientific laboratories built by the Stanford-Princeton AI Coscientist Team, a group led by Stanford University bioengineer Le Cong and Princeton University computer scientist Mengdi Wang, with founding partners that include NVIDIA. Powered by NVIDIA's vision-language models to process visual data, the system is designed to provide AI with real-time knowledge of lab work so it can determine what causes experiments to fail or succeed and rapidly train new scientists to expert levels by guiding them through experimental protocols.
Artificial intelligence
Artificial intelligence
fromHackernoon
2 months ago

Segment Anything in Motion: A Hands-On Guide to sam3-video | HackerNoon

sam3-video is a unified foundation model from Meta Research for prompt-based segmentation that performs segmentation in both images and videos.
Artificial intelligence
fromwww.socialmediatoday.com
1 month ago

Google introduces next iteration of AI image generation model

Google launched Nano Banana 2, a unified AI image generation model combining previous capabilities with advanced world knowledge, real-time web search integration, and enhanced control features for faster, more accurate visual creation.
Artificial intelligence
fromMail Online
1 month ago

Can you tell the difference between real and AI-generated people?

People are overconfident in their ability to distinguish AI-generated faces from real ones and perform only slightly better than chance.
Artificial intelligence
fromeLearning Industry
2 months ago

Artificial Intelligence In Transportation Training And Education

AI enables individualized transportation training by evaluating trainee performance, tailoring instruction, simulating real scenarios, and measuring performance for targeted improvement.
Artificial intelligence
fromInfoWorld
2 months ago

What is context engineering? And why it's the new AI architecture

Context engineering designs and manages the information, tools, and constraints an LLM receives, enabling scalable, high-signal inputs and improved model outcomes.
[ Load more ]