#ai-performance

[ follow ]
fromZDNET
1 week ago

5 ways to be great AI agent manager, according to business leaders

Antony Hausdoerfer emphasized that successful AI managers must ensure AI agents deliver trusted value and safe operations, focusing on applications that yield meaningful outcomes.
Business
#language-models
Artificial intelligence
fromArs Technica
4 months ago

New AI text diffusion models break speed barriers by pulling words from noise

Diffusion models offer comparable performance to traditional models but with dramatically improved speed, changing dynamics in AI applications.
Artificial intelligence
fromInfoWorld
2 months ago

Learning how to measure genAI's impact

AI model improvements are often difficult to quantify accurately.
Smaller language models may outperform larger ones in practical applications.
The debate on AGI misdefines human intelligence benchmarks.
fromHackernoon
1 year ago
Artificial intelligence

phi-3-mini's Triumph: Redefining Performance on Academic LLM Benchmarks | HackerNoon

Artificial intelligence
fromArs Technica
4 months ago

New AI text diffusion models break speed barriers by pulling words from noise

Diffusion models offer comparable performance to traditional models but with dramatically improved speed, changing dynamics in AI applications.
Artificial intelligence
fromInfoWorld
2 months ago

Learning how to measure genAI's impact

AI model improvements are often difficult to quantify accurately.
Smaller language models may outperform larger ones in practical applications.
The debate on AGI misdefines human intelligence benchmarks.
fromHackernoon
1 year ago
Artificial intelligence

phi-3-mini's Triumph: Redefining Performance on Academic LLM Benchmarks | HackerNoon

#atari-2600
Artificial intelligence
fromGadgets 360
1 month ago

Should You Buy an AI PC? In Conversation With Asus' Arnold Su

Asus' ROG laptops with RTX 50 series GPUs emphasize gaming and AI performance, marking significant consumer interest and pre-orders in India.
fromIT Pro
2 months ago

Acer's new Swift Edge 14 AI is a Copilot+ MacBook Air killer

The Swift Edge 14 AI Copilot+ PC is one of the lightest devices in its category, weighing only 0.99kg, making it a highly portable laptop choice.
Apple
#qualcomm
#openai
fromTechCrunch
2 months ago
Artificial intelligence

Improvements in 'reasoning' AI models may slow down soon, analysis finds | TechCrunch

fromZDNET
3 months ago
Artificial intelligence

OpenAI's Deep Research has more fact-finding stamina than you, but it's still wrong half the time

fromTechCrunch
2 months ago
Artificial intelligence

OpenAI's o3 AI model scores lower on a benchmark than the company initially implied | TechCrunch

Artificial intelligence
fromZDNET
3 months ago

OpenAI's Deep Research has more fact-finding stamina than you, but it's still wrong half the time

OpenAI's Deep Research technology surpasses other models and humans in web searches, but still fails nearly half the time.
fromTechCrunch
2 months ago
Artificial intelligence

OpenAI's o3 AI model scores lower on a benchmark than the company initially implied | TechCrunch

fromTechCrunch
3 months ago

Meta's vanilla Maverick AI model ranks below rivals on a popular chat benchmark | TechCrunch

The incident prompted the maintainers of LM Arena to apologize, change their policies, and score the unmodified, vanilla Maverick.
Artificial intelligence
#nvidia
Artificial intelligence
fromHackernoon
3 months ago

Nvidia Promises 40x Hopper Performance in Blackwell Unveil at GTC 2025 | HackerNoon

NVIDIA's Blackwell platform offers groundbreaking AI performance improvements.
Baidu's ERNIE 4.5 outperforms GPT-4o at a fraction of the cost.
Artificial intelligence
fromHackernoon
3 months ago

Nvidia Promises 40x Hopper Performance in Blackwell Unveil at GTC 2025 | HackerNoon

NVIDIA's Blackwell platform offers groundbreaking AI performance improvements.
Baidu's ERNIE 4.5 outperforms GPT-4o at a fraction of the cost.
Data science
fromComputerworld
4 months ago

Chat with your data: How 4 genAI tools stack up

AI tools vary in effectiveness for retrieving specific information from social media and structured data sources.
Claude and NotebookLM performed better in targeted searches than ChatGPT and Perplexity.
Challenges of navigating extensive datasets highlight real-world applications in demographic research.
[ Load more ]