#topic-modeling

[ follow ]
#ai
Data science
fromInfoQ
1 week ago

Context Engineering with Adi Polak

Context engineering moves beyond prompt engineering to enhance AI systems by adapting language and practices for better model interaction.
Philosophy
fromJames Bennett
4 days ago

Let's talk about LLMs

The current technological landscape may represent a significant shift driven by large language models, but its ultimate impact remains uncertain.
Data science
fromMedium
4 days ago

The Top 10 LLM Training Datasets for 2026

Large language models require extensive training data, and practitioners can utilize ten leading public datasets for effective training and fine-tuning.
#ai-agents
Data science
fromMedium
1 week ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
fromZDNET
2 months ago
Artificial intelligence

Is your AI agent up to the task? 3 ways to determine when to delegate

Data science
fromMedium
1 week ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
fromZDNET
2 months ago
Artificial intelligence

Is your AI agent up to the task? 3 ways to determine when to delegate

Software development
fromInfoWorld
1 week ago

Meta shows structured prompts can make LLMs more reliable for code review

Code review is evolving towards machine-led verification, improving accuracy but introducing tradeoffs like increased latency and workflow overhead.
#ai-detection
Marketing tech
fromWashington City Paper
2 weeks ago

Top 6 AI Detector Tools for Editors, Educators, and Content Teams

AI detection is essential for maintaining content integrity as patterns of AI-generated content become more prevalent and indistinguishable from human writing.
Marketing tech
fromWashington City Paper
2 weeks ago

Top 6 AI Detector Tools for Editors, Educators, and Content Teams

AI detection is essential for maintaining content integrity as patterns of AI-generated content become more prevalent and indistinguishable from human writing.
Python
fromPyImageSearch
2 weeks ago

Autoregressive Model Limits and Multi-Token Prediction in DeepSeek-V3 - PyImageSearch

Multi-Token Prediction (MTP) in DeepSeek-V3 allows simultaneous token forecasting, enhancing training speed and contextual understanding.
Scala
fromMedium
2 weeks ago

Data Extraction and Classification Using Structural Pattern Matching in Scala

Scala pattern matching enhances code readability and extensibility in real-world data engineering use cases.
#structured-data
Data science
fromAol
1 week ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
1 week ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
1 week ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
1 week ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
#ollama
Science
fromNature
3 weeks ago

Drowning in data sets? Here's how to cut them down to size

The Square Kilometre Array Observatory will generate massive data, but storage and retention pose significant challenges for researchers.
Science
fromThe Cipher Brief
3 weeks ago

Why the U.S. Must Build the Ultimate Multi-Modal Foundation Model

Advanced AI models like AlphaEarth demonstrate pixel-level geospatial intelligence capabilities that must be integrated into U.S. national security frameworks to maintain technological leadership.
Business intelligence
fromTechzine Global
1 month ago

ThoughtSpot launches Spotter Semantics for AI agents

ThoughtSpot introduces Spotter Semantics, a semantic layer using deterministic search tokens to deliver consistent, trustworthy insights to AI agents and business users without relying on LLM-powered text-to-SQL.
Software development
fromMedium
3 weeks ago

Inside Dify AI: How RAG, Agents, and LLMOps Work Together in Production

Dify AI provides a unified platform for deploying production language model systems with built-in solutions for data freshness, observability, versioning, and safe deployment across multiple cloud environments.
Artificial intelligence
fromwww.scientificamerican.com
4 weeks ago

As AI keeps improving, mathematicians struggle to foretell their own future

First Proof, a benchmarking initiative, is launching its second round to evaluate large language models' ability to contribute to research-level mathematics, now requiring transparency and access from participating AI companies.
Artificial intelligence
fromMail Online
1 month ago

Can you tell which of these was written by ChatGPT?

Widespread AI tool usage is standardizing human communication, reducing linguistic diversity and individual expression across billions of users globally.
Data science
fromInfoQ
1 month ago

Google Researchers Propose Bayesian Teaching Method for Large Language Models

Google researchers developed a training method enabling large language models to approximate Bayesian reasoning by learning from optimal Bayesian system predictions, improving belief updates during multi-step interactions.
#google-ai-mode
Medicine
fromHarvard Gazette
2 months ago

New AI tool predicts brain age, dementia risk, cancer survival - Harvard Gazette

BrainIAC, a brain imaging adaptive core, accurately extracts multiple disease risk signals from routine brain MRIs using self-supervised learning and limited training data.
fromPsychology Today
1 month ago

How AI Is Rewiring Winemaking and Wine Collecting

The Brain Science Here's where neuropsychology enters the vineyard. The human brain's relationship with wine is deeply emotional and multisensory. When we taste wine, our orbitofrontal cortex integrates sensory information with memory and emotion; it's why a particular bottle might remind us of our grandmother's kitchen or that study-abroad summer in Tuscany. This neural complexity is what makes wine special, and it's also what makes AI's role in the industry controversial.
Wine
Python
fromPyImageSearch
2 months ago

TF-IDF vs. Embeddings: From Keywords to Semantic Search - PyImageSearch

Vector databases and embeddings enable semantic search and retrieval-augmented generation by mapping text meaning into geometric vectors for similarity-based retrieval.
Information security
fromSecuritymagazine
2 months ago

Product Spotlight on Analytics

Taelor Sutherland is Associate Editor at Security magazine covering enterprise security, coordinating digital content, and holding a BA in English Literature from Agnes Scott College.
fromPractical Ecommerce
1 month ago

AI Turns Weather Data into Sales

Weather impacts sales. Every retailer knows it. But for most, the likelihood that it might rain, snow, or sleet on the third of March somewhere in the Midwest is rarely used. Vendors such as Weather Trends have offered accurate, long-range forecasts for more than 20 years. But the opportunity is not predicting the weather; it's knowing what to do with the data. AI might change that.
E-Commerce
Marketing tech
fromExchangewire
2 months ago

Ads and LLMs: Gearing Up for a New Era of Advertising

LLM-integrated ads will deliver contextual, in-conversation, intent-driven promotions, forcing advertisers to adapt creative strategies while accepting reduced control over ad placement.
Python
fromPyImageSearch
1 month ago

Vector Search Using Ollama for Retrieval-Augmented Generation (RAG) - PyImageSearch

Retrieval-Augmented Generation (RAG) augments LLMs with retrieved context from vector search (FAISS) to produce accurate, up-to-date, evidence-grounded responses.
Science
fromFlowingData
2 months ago

Your interpretation of uncertainty language compared

Verbal probability expressions can be mapped to percentage values between 0% (impossible) and 100% (definite) to quantify uncertainty.
Artificial intelligence
fromTheregister
1 month ago

AI models get better at math but still get low marks

Current LLMs struggle with mathematical accuracy, with even top performers scoring C-grade equivalent on practical math benchmarks, though recent versions show modest improvements.
fromFortune
1 month ago

We studied chatbots and language and saw a huge problem: They mean 80% when they say 'likely' but humans hear 65% | Fortune

By comparing how AI models and humans map these words to numerical percentages, we uncovered significant gaps between humans and large language models. While the models do tend to agree with humans on extremes like 'impossible,' they diverge sharply on hedge words like 'maybe.' For example, a model might use the word 'likely' to represent an 80% probability, while a human reader assumes it means closer to 65%.
Artificial intelligence
fromInfoQ
1 month ago

Building Embedding Models for Large-Scale Real-World Applications

What happens under the hood? How is the search engine able to take that simple query, look for images in the billions, trillions of images that are available online? How is it able to find this one or similar photos from all that? Usually, there is an embedding model that is doing this work behind the hood.
Artificial intelligence
fromMedium
2 months ago

From Graphs to Generative AI: Building Context That Pays-Part 1

Every year, poor communication and siloed data bleed companies of productivity and profit. Research shows U.S. businesses lose up to $1.2 trillion annually to ineffective communication, that's about $12,506 per employee per year. This stems from breakdowns that waste an average of 7.47 hours per employee each week on miscommunications. The damage isn't only interpersonal; it's structural. Disconnected and fragmented data systems mean that employees spend around 12 hours per week just searching for information trapped in those silos.
Data science
Data science
fromCIO
2 months ago

5 perspectives on modern data analytics

Data/business analytics is the top IT investment priority, yet analytics projects often fail due to poor data, vague objectives, and one-size-fits-all solutions.
fromNature
2 months ago

Multimodal learning with next-token prediction for large multimodal models - Nature

Since AlexNet5, deep learning has replaced heuristic hand-crafted features by unifying feature learning with deep neural networks. Later, Transformers6 and GPT-3 (ref. 1) further advanced sequence learning at scale, unifying structured tasks such as natural language processing. However, multimodal learning, spanning modalities such as images, video and text, has remained fragmented, relying on separate diffusion-based generation or compositional vision-language pipelines with many hand-crafted designs.
Artificial intelligence
Data science
fromNature
2 months ago

How to stop the survey-taking AI chatbots that threaten to upend social science

Online survey recruitment faces widespread inauthentic and automated responses, increasingly amplified by AI agents, threatening data validity.
Artificial intelligence
fromInfoQ
2 months ago

MIT's Recursive Language Models Improve Performance on Long-Context Tasks

Recursive Language Models enable LLMs to handle inputs up to 100x longer by using a programming environment and recursive code to decompose and preprocess prompts.
Artificial intelligence
fromNature
2 months ago

Training large language models on narrow tasks can lead to broad misalignment - Nature

Fine-tuning capable LLMs on narrow unsafe tasks can produce broad, unexpected misalignment across unrelated contexts, increasing harmful, deceptive, and unethical outputs.
Artificial intelligence
fromInfoQ
2 months ago

Foundation Models for Ranking: Challenges, Successes, and Lessons Learned

Large-scale search and recommendation systems use two-stage retrieval and ranking pipelines to efficiently serve personalized results for hundreds of millions of users and items.
Artificial intelligence
fromNature
2 months ago

AI chatbots are infiltrating social-science surveys - and getting better at avoiding detection

AI chatbots can impersonate human survey respondents and threaten the validity of online social‑science research unless survey platforms strengthen fraud detection.
Artificial intelligence
fromInfoQ
2 months ago

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Prioritize small, resource-efficient models and iterative, human-in-the-loop data creation to build practical, improvable AI under infrastructure and data constraints.
Artificial intelligence
fromWIRED
2 months ago

The Math on AI Agents Doesn't Add Up

Transformer-based LLMs have fundamental computational limitations that prevent them from reliably performing complex agentic tasks, making full automation unlikely.
Artificial intelligence
fromInfoWorld
2 months ago

What is context engineering? And why it's the new AI architecture

Context engineering designs and manages the information, tools, and constraints an LLM receives, enabling scalable, high-signal inputs and improved model outcomes.
fromFast Company
2 months ago

Are LTMs the next LLMs? This new type of AI can do what large-language models can't

A major difference between LLMs and LTMs is the type of data they're able to synthesize and use. LLMs use unstructured data-think text, social media posts, emails, etc. LTMs, on the other hand, can extract information or insights from structured data, which could be contained in tables, for instance. Since many enterprises rely on structured data, often contained in spreadsheets, to run their operations, LTMs could have an immediate use case for many organizations.
Artificial intelligence
fromComputerworld
2 months ago

OpenAI's GPT is getting better at mathematics

OpenAI's GPT-5.2 Pro does better at solving sophisticated math problems than older versions of the company's top large language model, according to a new study by Epoch AI, a non-profit research institute.
Artificial intelligence
fromTheregister
2 months ago

OpenAI will try to guess your age before ChatGPT gets spicy

sensitive or potentially harmful content.
Artificial intelligence
Artificial intelligence
fromComputerworld
1 month ago

Researchers propose a self-distillation fix for 'catastrophic forgetting' in LLMs

Continual learning is essential for foundation models; SDFT uses in-context learning to generate on-policy signals, avoiding explicit reward functions and reducing forgetting.
Artificial intelligence
fromInfoQ
2 months ago

Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache Spark

A Q-learning agent autonomously learns and generalizes optimal Spark configurations by discretizing dataset features and combining with Adaptive Query Execution for superior performance.
[ Load more ]