#data-processing

[ follow ]
Intellectual property law
fromPatently-O
1 day ago

Revisiting Sixteen Years of 101 After a Data Correction

Corrected data processing increases post-2019 §101 rejection recovery, raising rejection rates to about 15.5% by mid-2025, while later declines remain smaller.
Artificial intelligence
fromTelecompetitor
1 week ago

Scott Alcott's agentic AI framework for broadband providers: Interview

Agentic AI dynamically gathers evidence, reasons across data types, and triggers actions, enabling broadband providers to automate workflows beyond assistive or generative AI.
Business
from24/7 Wall St.
1 week ago

Visa vs Mastercard. The Numbers Just Told Us Which Network Is Pulling Away

Visa grew net revenue 14.6% with Data Processing as the main driver, while Mastercard grew 15.83% with Value-Added Services accelerating and stablecoin and agentic commerce initiatives advancing.
Data science
fromeLearning Industry
1 month ago

Multimodal AI For Instructional Designers: What It Is, How It Works, And Why It Changes Learning Design

Multimodal AI processes and generates multiple data types, enhancing understanding and output accuracy by mimicking human information processing.
#ai
#apache-spark
fromMedium
11 months ago
Data science

Leveraging Broadcast Joins in Apache Spark (Scala)

Broadcast joins optimize Spark for faster dataset joins by broadcasting smaller datasets, avoiding costly shuffle operations.
fromMedium
11 months ago
Scala

From Frustrating to Fast: Speeding Up Spark Tests Using Shared Sessions

Using a shared Spark session significantly reduces the execution time for unit tests in Spark jobs.
DevOps
fromInfoQ
1 month ago

Pinterest Reduces Spark OOM Failures by 96% Through Auto Memory Retries

Pinterest Engineering reduced out-of-memory failures in Apache Spark workloads by 96% through improved observability, configuration tuning, and automatic memory retries.
Java
fromMedium
2 months ago

Spark Internals: Understanding Tungsten (Part 2)

Catalyst Optimizer and Tungsten work together in Apache Spark to optimize data execution and manage raw binary data.
fromInfoWorld
1 month ago

How Apache Kafka flexed to support queues

Apache Kafka has cemented itself as the de facto platform for event streaming, often referred to as the 'universal data substrate' due to its extensive ecosystem that enables connectivity and processing capabilities.
Scala
Privacy professionals
fromInfoQ
2 months ago

"Pick and Mix" Custom Regions: Cloudflare Introduces Fine-Grained Data Residency Control

Cloudflare's Custom Regions allow customers to define where their data is processed for compliance and control.
DevOps
fromInfoQ
2 months ago

Inside Agoda's Storefront: A Latency-Aware Reverse Proxy for Improving DNS Based Load Distribution

Agoda developed Storefront, an S3-compatible proxy, to enhance load balancing and reliability for large-scale object storage traffic.
DevOps
fromTechzine Global
2 months ago

OpenObserve lowers observability storage costs by 140x

OpenObserve offers an AI-native open source platform that significantly reduces costs and infrastructure needs in the observability market.
Data science
fromInfoWorld
2 months ago

Oracle adds pre-built agents to Private Agent Factory in AI Database 26ai

Structured Data Analysis Agent enhances data processing capabilities for enterprises using tools like Python's pandas library.
Data science
fromTheregister
2 months ago

CERN eggheads burn AI into silicon to stem data deluge

CERN uses custom AI to optimize real-time data collection from the Large Hadron Collider, processing hundreds of terabytes per second.
Digital life
fromInfoWorld
2 months ago

AI optimization: How we cut energy costs in social media recommendation systems

Optimizing data processing in AI can significantly reduce energy consumption and operational costs.
History
fromBig Think
3 months ago

The computing revolution that secretly began in 1776

Computing emerged during the Industrial Revolution as mechanized, systematized calculation to process vast data for astronomy, mapping, trade, and large-scale production.
fromTheregister
6 months ago

Google Workspace AI 'smart features' are on by default

Engineering YouTuber Dave Jones noticed this week that he had been opted into a set of new Workspace smart features without ever being asked. According to Google's help page for the features, the point of the on-by-default settings is to add its Gemini AI across Workspace in order to suck in all your Gmail, Calendar, Chat, Drive, and Meet data so that it can all be cross-referenced.
Privacy technologies
Science
fromwww.nature.com
8 months ago

Publisher Correction: Experimental determination of partial charges with electron diffraction

A missing citation for the XDS software (Kabsch, 2010) was added to the HTML and PDF versions.
fromFast Company
9 months ago

What if the future looks exactly like the past?

When Peter Drucker first met IBM CEO Thomas J. Watson in the 1930s, the legendary management thinker and journalist was somewhat baffled. "He began talking about something called data processing," Drucker recalled, "and it made absolutely no sense to me. I took it back and told my editor, and he said that Watson was a nut, and threw the interview away."
Business
Artificial intelligence
fromFast Company
9 months ago

This startup claims it just outran Nvidia on its own turf

DataPelago's Nucleus dramatically accelerates data processing across hardware, outperforming Nvidia's cuDF and exposing significant software-driven GPU performance limitations.
fromChannelPro
9 months ago

Channel Focus: All you need to know about Snowflake's partner program

The Snowflake Platform enhances enterprises' data management under an AI Data Cloud, focusing on self-managed services, governance, and visibility without requiring extensive end-user hardware.
Artificial intelligence
Gadgets
fromHackernoon
6 years ago

Inside the Bonkers DIY Project to Corral Every Gadget Rumor on Earth | HackerNoon

A system architecture is designed to fetch and analyze tech news, utilizing Kafka, MinIO, and ClickHouse for data processing.
Tech industry
fromComputerWeekly.com
9 months ago

EDGX closes 2.3m funding to boost AI compute for satellites | Computer Weekly

EDGX's Sterna DPU provides AI acceleration for satellites, enabling real-time in-orbit data processing for faster, more efficient services.
fromTechzine Global
9 months ago

Snowflake launches Snowpark Connect to run Spark code natively

Snowpark Connect facilitates Apache Spark code execution directly within Snowflake warehouses, eliminating the need for separate Spark clusters and associated complexities like data movement.
Data science
Python
fromHackernoon
8 years ago

5 Python Libraries I Wish I'd Found Sooner | HackerNoon

Five Python libraries can drastically improve data processing efficiency and debugging experience.
#quantum-computing
fromFortune
10 months ago
Artificial intelligence

Quantum computing is so fire - No, seriously. BofA says it could be humanity's biggest breakthrough since the discovery of fire

fromFortune
10 months ago
Artificial intelligence

Quantum computing is so fire - No, seriously. BofA says it could be humanity's biggest breakthrough since the discovery of fire

#artificial-intelligence
fromLogRocket Blog
10 months ago

Iterator helpers: The most underrated feature in ES2025 - LogRocket Blog

Iterators are built exactly for scenarios where processing data lazily is essential, avoiding loading everything into memory, thus preventing memory overload and application crashes.
JavaScript
fromwww.independent.co.uk
10 months ago

UK's most powerful supercomputer comes online in major AI drive

Britain's most powerful supercomputer, Isambard-AI, has come online to enhance AI research, aiming to develop medical cures and tools to reduce emissions.
UK news
E-Commerce
fromAlleywatch
10 months ago

Heron Data Raises $16.6M to Automate Document-Heavy Workflows with AI

Heron Data uses an AI-powered workflow platform to automate data processing, reducing bottlenecks in business decision-making and improving efficiency in organizations.
fromHackernoon
2 years ago

No-Code Automation With n8n - Start Here | HackerNoon

n8n is a powerful, open-source workflow automation tool that connects different apps, automates repetitive tasks, and streamlines operations without coding.
Online marketing
fromHackernoon
1 year ago

A Developer's Guide to SeaTunnel and Hive Integration with Real-World Configs | HackerNoon

Apache SeaTunnel's high-performance framework enables rapid collection, transformation, and loading of massive datasets, essential for efficient data flow in a big data ecosystem.
Data science
Python
fromMedium
10 months ago

Competition of data processing languages on JVM: Kotlin, Scala and SPL

Kotlin, Scala, and SPL are compared to establish which data processing language offers the highest development efficiency.
fromInfoQ
10 months ago

Inflection Points in Engineering Productivity as Amazon Grew 30x

Black Friday is one of the busiest shopping days for Amazon, alongside Cyber Monday and Prime Day, showcasing the critical retail events for the company.
E-Commerce
Privacy technologies
fromwww.theguardian.com
10 months ago

Palantir accuses UK doctors of choosing ideology over patient interest' in NHS data row

Palantir claims to enhance NHS data processing and patient care despite criticism from British doctors regarding their contract and data handling.
Artificial intelligence
fromMedium
1 year ago

Build Multi-Agentic AI Agents with AWS Bedrock from Scratch..

A multi-agent system is created to facilitate collaboration between different agents, specifically designed for handling user queries.
The agent named 'bedrock-supervisor-agent' is tasked with assessing and directing user questions related to accommodations or restaurants.
fromHackernoon
11 months ago

How to Write Complex Queries in Apache Spark SQL Using CTE (WITH Clause) | HackerNoon

A Common Table Expression (CTE) is a named, temporary result set defined within a single SQL statement, which helps in improving query readability and maintainability.
Data science
Scala
fromInfoQ
11 months ago

Supporting Diverse ML Systems at Netflix

Netflix leverages advanced machine learning infrastructure to optimize content recommendations and operational efficiency across various business use cases.
fromFast Company
11 months ago

The real data revolution hasn't happened yet

The Gartner Hype Cycle illustrates the path of new technologies, depicting public perception from over-expectation to eventual maturity.
Data science
[ Load more ]