#cloud-service-reliability

[ follow ]
#aws
fromInfoQ
20 hours ago
DevOps

AWS Launches Sustainability Console with API Access and Scope 1-3 Emissions Reporting

DevOps
fromAmazon Web Services
1 hour ago

Troubleshooting environment with AI analysis in AWS Elastic Beanstalk | Amazon Web Services

AWS Elastic Beanstalk simplifies web application deployment and scaling, now enhanced with AI Analysis for troubleshooting environment health issues.
DevOps
fromInfoQ
20 hours ago

AWS Launches Sustainability Console with API Access and Scope 1-3 Emissions Reporting

AWS launched a Sustainability console for consolidated carbon emissions reporting with enhanced access and API features.
DevOps
fromTheregister
4 days ago

AWS put a file system on S3; I stress-tested it

AWS S3 Files allows mounting S3 buckets as NFS shares, providing solid conflict resolution and cost-effective storage options.
DevOps
fromTechzine Global
3 days ago

AWS launches Agent Registry for managing AI agents

AWS introduces the Agent Registry to centralize AI agent management and reduce chaos in organizations deploying numerous agents.
DevOps
fromTheregister
4 days ago

AWS: Agents shouldn't be secret, so we built a registry

AWS Agent Registry enhances visibility and control over AI agents in corporate environments.
DevOps
fromInfoWorld
3 days ago

AWS targets AI agent sprawl with new Bedrock Agent Registry

AWS introduces Agent Registry to help enterprises manage and govern AI agents effectively.
Data science
fromInfoWorld
3 hours ago

Google Cloud introduces QueryData to help AI agents create reliable database queries

QueryData enhances AI agents' accuracy in querying databases by translating natural language into precise database queries.
fromTechzine Global
6 hours ago

Commvault launches AI tools for secure agentic AI era

"In agentic environments, agents mutate state across data, systems, and configurations in ways that compound fast and are hard to trace," says Pranay Ahlawat, Chief Technology and AI Officer at Commvault.
Information security
#datacentres
Environment
fromComputerWeekly.com
9 hours ago

Go West! US datacentres head for available and cheap energy | Computer Weekly

Datacentre construction in the US is shifting towards central regions, particularly Texas, due to power availability and water use mitigation technologies.
London startup
fromComputerWeekly.com
6 hours ago

Datacentre developers tout benefits to local communities, but do they deliver? | Computer Weekly

Datacentre developments are causing challenges for local businesses, raising concerns about energy consumption and community impact despite potential local benefits.
Environment
fromComputerWeekly.com
9 hours ago

Go West! US datacentres head for available and cheap energy | Computer Weekly

Datacentre construction in the US is shifting towards central regions, particularly Texas, due to power availability and water use mitigation technologies.
London startup
fromComputerWeekly.com
6 hours ago

Datacentre developers tout benefits to local communities, but do they deliver? | Computer Weekly

Datacentre developments are causing challenges for local businesses, raising concerns about energy consumption and community impact despite potential local benefits.
Careers
fromwww.businessinsider.com
6 hours ago

The future may be 'frustrating' for engineers seeking a 'pure software development career,' AWS VP says

Junior software engineers may increasingly engage with customers rather than solely focusing on coding in isolation.
#microsoft
World news
fromTheregister
5 days ago

Microsoft hints at bit bunkers for war zones

Microsoft is redesigning datacenters in conflict-prone regions due to Iranian attacks targeting Middle Eastern facilities linked to US military operations.
Tech industry
fromTheregister
3 days ago

Microsoft cuts cloudy desktop prices by 20 percent

Microsoft is reducing Windows 365 cloud PC prices by 20% to enhance cost-effectiveness for small and medium businesses starting May 1st.
World news
fromTheregister
5 days ago

Microsoft hints at bit bunkers for war zones

Microsoft is redesigning datacenters in conflict-prone regions due to Iranian attacks targeting Middle Eastern facilities linked to US military operations.
Tech industry
fromTheregister
3 days ago

Microsoft cuts cloudy desktop prices by 20 percent

Microsoft is reducing Windows 365 cloud PC prices by 20% to enhance cost-effectiveness for small and medium businesses starting May 1st.
fromInfoQ
3 days ago

Latency: The Race to Zero...Are We There Yet?

In the fintech industry we can link latency directly to profit and money. If I have lower latency than the competition, I can get to the better deals, I can make the better deals.
Venture
#amazon
Tech industry
fromTheregister
3 days ago

AWS ponders selling its home-grown chips by the rack-load

Amazon's chip business could generate ~$50 billion annually if sold independently, highlighting significant demand and growth potential.
DevOps
fromwww.businessinsider.com
3 days ago

Amazon creates 'Project Houdini' to make data center delays disappear

Amazon's Project Houdini aims to speed up data center construction by moving processes to factories, addressing AI demand and capacity constraints.
Tech industry
fromTheregister
3 days ago

AWS ponders selling its home-grown chips by the rack-load

Amazon's chip business could generate ~$50 billion annually if sold independently, highlighting significant demand and growth potential.
DevOps
fromwww.businessinsider.com
3 days ago

Amazon creates 'Project Houdini' to make data center delays disappear

Amazon's Project Houdini aims to speed up data center construction by moving processes to factories, addressing AI demand and capacity constraints.
#cloud-computing
Higher education
fromInfoWorld
3 days ago

Cloud degrees are moving online

Accredited online cloud computing degrees are expanding, reducing costs and providing practical value for students and employers.
DevOps
fromInfoWorld
2 weeks ago

Edge clouds and local data centers reshape IT

Cloud computing is evolving towards a selectively distributed model to address latency, sovereignty, and resilience in smart cities and AI applications.
Higher education
fromInfoWorld
3 days ago

Cloud degrees are moving online

Accredited online cloud computing degrees are expanding, reducing costs and providing practical value for students and employers.
DevOps
fromInfoWorld
2 weeks ago

Edge clouds and local data centers reshape IT

Cloud computing is evolving towards a selectively distributed model to address latency, sovereignty, and resilience in smart cities and AI applications.
DevOps
fromTechzine Global
5 hours ago

Cloudflare introduces new features for building and deploying agents

Cloudflare is transforming AI development with Dynamic Workers, Sandboxes, and Artifacts for secure, scalable, and efficient code execution.
#ai
Software development
fromDevOps.com
6 days ago

If it Isn't Code, it's Just Advice - DevOps.com

AI coding agents struggle with third-party systems and dashboard configurations, limiting their effectiveness in automation and verification.
Software development
fromDevOps.com
6 days ago

If it Isn't Code, it's Just Advice - DevOps.com

AI coding agents struggle with third-party systems and dashboard configurations, limiting their effectiveness in automation and verification.
DevOps
fromDevOps.com
3 days ago

CloudBees Delivers on AI Promise to Improve Application Testing - DevOps.com

CloudBees Smart Tests uses AI to prioritize tests, reducing CI/CD processing time significantly.
Information security
from24/7 Wall St.
3 days ago

The "SaaS-Pocalypse" Continues: Cloudflare, ServiceNow, CrowdStrike Under Fire as Anthropic Rewrites the Rules

The release of Anthropic's AI security product has significantly impacted investor confidence in enterprise software companies, leading to sharp stock declines.
Tech industry
fromComputerWeekly.com
5 days ago

Azure customers up in arms over 'full' UK South region | Computer Weekly

Microsoft Azure is facing capacity issues in the UK South region, affecting virtual machine availability and customer migrations.
Artificial intelligence
from24/7 Wall St.
4 days ago

The Real Reason Cloudflare Is Down 11% Today Has Nothing to Do With Insider Selling

Insider selling at Cloudflare is routine and does not indicate trouble; the real concern is competition from Anthropic's new AI offerings.
London startup
fromEngadget
4 days ago

OpenAI 'pauses' its Stargate UK data center plan

OpenAI is pausing the Stargate UK project due to high energy costs and regulatory issues, despite recognizing the potential for AI in the UK.
#devops
DevOps
fromInfoWorld
6 days ago

What enterprise devops teams should learn from SaaS

Enterprise devops teams can enhance resiliency by adopting practices from SaaS providers, focusing on robust testing, monitoring, and seamless upgrades.
DevOps
fromDevOps.com
15 hours ago

Ten Great DevOps Job Opportunities - DevOps.com

DevOps.com is launching a weekly jobs report to highlight opportunities for DevOps professionals.
DevOps
fromMedium
1 day ago

Kubernetes Is Not DevOps : A Short Story

Understanding systems behind tools is crucial for effective DevOps engineering.
DevOps
fromInfoWorld
6 days ago

What enterprise devops teams should learn from SaaS

Enterprise devops teams can enhance resiliency by adopting practices from SaaS providers, focusing on robust testing, monitoring, and seamless upgrades.
DevOps
fromMedium
1 day ago

Set it up once, test it properly, and let the system handle the rest.

Automating SSL certificate renewal prevents production outages and reduces stress during incidents.
#nutanix
Tech industry
fromTheregister
5 days ago

Nutanix thinks some Azure cloud desktops belong on-prem

Nutanix partners with Microsoft to enhance on-prem desktop virtualization, addressing challenges of VDI and promoting hybrid operations for Azure Virtual Desktop.
DevOps
fromTechzine Global
4 days ago

Nutanix won't give AI free rein: infrastructure remains a human endeavor

Nutanix focuses on facilitating AI workloads while maintaining human oversight in IT management, emphasizing minimal changes for infrastructure administrators.
DevOps
fromTechzine Global
6 days ago

As IT complexity escalates, Nutanix fights back

Nutanix is prioritizing flexibility and aims to be a leading agentic AI platform amidst external IT developments.
fromSilicon Canals
6 days ago

When militaries share data centers with banks: how Gulf strikes exposed a structural flaw in global cloud infrastructure - Silicon Canals

When civilian banks, logistics platforms, and payment processors share physical data center infrastructure with military AI systems, those facilities become legitimate military targets under international humanitarian law - and the civilian services housed inside lose their legal protection.
Information security
DevOps
fromInfoQ
3 days ago

Google Cloud Highlights Ongoing Work on PostgreSQL Core Capabilities

Google Cloud has made significant technical contributions to PostgreSQL, enhancing logical replication, upgrade processes, and system stability.
DevOps
fromInfoWorld
5 days ago

AWS turns its S3 storage service into a file system for AI agents

S3 Files simplifies access to Amazon S3, enhancing its role as a primary data layer for AI and modern applications.
#kubernetes
DevOps
fromInfoWorld
4 days ago

Bringing databases and Kubernetes together

Automating Kubernetes workloads with Operators can provide DBaaS functionality while avoiding provider lock-in.
DevOps
fromMedium
1 week ago

Understanding Kubernetes Architecture is a MUST

Understanding Kubernetes architecture is essential for effective cloud-native deployment and troubleshooting.
DevOps
fromInfoWorld
4 days ago

Bringing databases and Kubernetes together

Automating Kubernetes workloads with Operators can provide DBaaS functionality while avoiding provider lock-in.
DevOps
fromMedium
1 week ago

Understanding Kubernetes Architecture is a MUST

Understanding Kubernetes architecture is essential for effective cloud-native deployment and troubleshooting.
#multicloud-strategy
fromTechzine Global
5 days ago

AWS S3 buckets now support file systems

S3 Files is built on Amazon EFS and automatically translates file system operations into S3 requests, allowing applications to work with S3 data without code changes.
DevOps
#cloud-monitoring
fromNew Relic
1 week ago
DevOps

Cloud Monitoring Best Practices For Reliable, Unified Observability

Effective cloud monitoring focuses on unifying telemetry and providing context for engineers to make informed decisions.
DevOps
fromNew Relic
2 weeks ago

Cloud Monitoring Tools: 5 Best Platforms to Evaluate in 2026

Effective cloud monitoring focuses on real-time telemetry correlation to understand failures, not just data collection.
DevOps
fromNew Relic
1 week ago

Cloud Monitoring Best Practices For Reliable, Unified Observability

Effective cloud monitoring focuses on unifying telemetry and providing context for engineers to make informed decisions.
DevOps
fromNew Relic
2 weeks ago

Cloud Monitoring Tools: 5 Best Platforms to Evaluate in 2026

Effective cloud monitoring focuses on real-time telemetry correlation to understand failures, not just data collection.
Tech industry
fromTechzine Global
1 month ago

Amazon calls engineers together after AI-related outages

Amazon requires junior and mid-level engineers to obtain senior approval before deploying AI-assisted code changes following multiple outages linked to AI coding tools.
#observability
DevOps
fromDevOps.com
6 days ago

Survey Surfaces Rising Tide of Investments in Observability - DevOps.com

A significant number of enterprise IT leaders plan to invest heavily in observability to enhance application performance and reliability.
fromNew Relic
1 week ago
DevOps

What is observability? How observability can help you achieve your business goals.

Conventional monitoring fails to address unknown unknowns, while observability provides insights into complex systems and enhances incident response.
DevOps
fromDevOps.com
6 days ago

Survey Surfaces Rising Tide of Investments in Observability - DevOps.com

A significant number of enterprise IT leaders plan to invest heavily in observability to enhance application performance and reliability.
DevOps
fromNew Relic
1 week ago

What is observability? How observability can help you achieve your business goals.

Conventional monitoring fails to address unknown unknowns, while observability provides insights into complex systems and enhances incident response.
Software development
fromInfoWorld
1 month ago

The reliability cost of default timeouts

Unbounded waiting in distributed systems causes slowness to manifest as outages before traditional failure detection triggers, draining capacity and degrading user experience.
US politics
fromFortune
2 months ago

Inside the race to build data centers | Fortune

Mega-scale AI data centers are driving AI growth, transforming landscapes, straining energy and water resources, and creating major political and economic conflicts.
DevOps
fromInfoWorld
6 days ago

The Terraform scaling problem: When infrastructure-as-code becomes infrastructure-as-complexity

Terraform scales well for small teams but faces significant challenges as organizations grow, leading to complexity and management issues.
#network-monitoring
DevOps
fromNew Relic
1 week ago

6 Network Monitoring Best Practices For Clarity in Distributed Systems

Effective network monitoring prioritizes understanding impact and taking action quickly over merely collecting metrics.
DevOps
fromNew Relic
1 week ago

6 Network Monitoring Best Practices For Clarity in Distributed Systems

Effective network monitoring prioritizes understanding impact and taking action quickly over merely collecting metrics.
DevOps
fromNew Relic
1 week ago

Exploring application performance monitoring (APM)

Application performance monitoring (APM) is essential for businesses to ensure optimal user experiences and maintain application performance in a complex digital landscape.
DevOps
fromMedium
1 week ago

Fair Multitenancy-Beyond Simple Rate Limiting

Fair multitenancy ensures equitable infrastructure access for customers, balancing simplicity, performance, and safety in shared environments.
DevOps
fromInfoQ
1 week ago

Replacing Database Sequences at Scale Without Breaking 100+ Services

Validating requirements can simplify complex problems, and embedding sequence generation reduces network calls, enhancing performance and reliability.
Software development
fromLoopwerk
2 months ago

It's time to leave Heroku

Heroku went from a beloved free, frictionless hosting for hobby projects to a paid, unstable platform marked by security breaches, removed free tiers, and outages.
fromInfoWorld
2 months ago

The private cloud returns, for AI workloads

A North American manufacturer spent most of 2024 and early 2025 doing what many innovative enterprises did: aggressively standardizing on the public cloud by using data lakes, analytics, CI/CD, and even a good chunk of ERP integration. The board liked the narrative because it sounded like simplification, and simplification sounded like savings. Then generative AI arrived, not as a lab toy but as a mandate. "Put copilots everywhere," leadership said. "Start with maintenance, then procurement, then the call center, then engineering change orders."
Artificial intelligence
DevOps
fromAmazon Web Services
1 week ago

Securely connect AWS DevOps Agent to private services in your VPCs | Amazon Web Services

AWS DevOps Agent enhances operational efficiency by securely connecting to private resources in VPCs, optimizing performance and incident management.
fromDevOps.com
1 month ago

What to do About AI's Forced Rethink of Reliability in Modern DevOps - DevOps.com

For years, reliability discussions have focused on uptime and whether a service met its internal SLO. However, as systems become more distributed, reliant on complex internet stacks, and integrated with AI, this binary perspective is no longer sufficient. Reliability now encompasses digital experience, speed, and business impact. For the second year in a row, The SRE Report highlights this shift.
Software development
DevOps
fromInfoWorld
1 week ago

Azure's new AI modernization tools

Microsoft's Azure Copilot aids in application migration and modernization, addressing technical debt and improving cloud infrastructure management.
Artificial intelligence
fromInfoWorld
1 month ago

Five MCP servers to rule the cloud

Major cloud providers now offer official MCP servers that let AI agents automate cloud operations using existing cloud credentials and natural language commands.
Information security
fromThe Hacker News
2 months ago

When Cloud Outages Ripple Across the Internet

Cloud infrastructure outages can disable identity authentication and authorization, creating hidden single points of failure that cause broad operational and security impacts.
Software development
fromInfoWorld
1 month ago

Cloud Cloning: A new approach to infrastructure portability

Cloud Cloning captures complete cloud infrastructure snapshots and maps them onto target cloud services and configurations to enable accurate cloud portability.
#azure-outage
DevOps
fromInfoWorld
2 weeks ago

Rethinking VM data protection in cloud-native environments

KubeVirt enables Kubernetes to manage both VMs and containers, requiring new strategies for VM lifecycle management and data protection.
Information security
fromThe Hacker News
2 months ago

DevOps & SaaS Downtime: The High (and Hidden) Costs for Cloud-First Businesses

Relying solely on public cloud and DevOps SaaS platforms increases operational risk as outages, attacks, and Shared Responsibility gaps drive rising downtime and service degradation.
Software development
fromInfoWorld
2 months ago

Why cloud migration needs a new approach

Existing cloud-native migration tools, infrastructure-as-code, and governance solutions fail to provide true infrastructure portability, causing multicloud fragmentation and migration friction.
Tech industry
fromInfoWorld
2 months ago

The next 10 years for cloud computing

Enterprises are abandoning unquestioning public cloud adoption due to high costs, limited productivity gains, and vendor lock-in, prompting providers to change strategies.
Information security
fromTheregister
2 months ago

AI framework flaws put enterprise clouds at risk of takeover

Two Chainlit vulnerabilities enable arbitrary file reads and SSRF attacks, risking exposure of environment variables, credentials, and potential cloud takeover if not patched.
Artificial intelligence
fromEngadget
1 month ago

13-hour AWS outage reportedly caused by Amazon's own AI tools

An agentic Kiro AI action to delete and recreate an environment triggered a 13-hour AWS outage, enabled by a staffer’s broader-than-expected permissions.
Tech industry
fromTheregister
2 months ago

Microsoft's shift to cloud management sw brings concerns

Microsoft will deprecate SCOM management packs for SQL Server Reporting Services, Power BI Report Server and Analysis Services; support and updates end January 2027.
DevOps
fromInfoWorld
3 weeks ago

Cloud at 20: Cost, complexity, and control

Cloud computing has failed to deliver on its promise of simplified IT operations and cost savings, instead creating greater complexity and spiraling expenses for most enterprises.
DevOps
fromInfoQ
3 weeks ago

Configuration as a Control Plane: Designing for Safety and Reliability at Scale

Configuration in cloud-native systems is a dynamic control plane that directly influences system behavior and reliability at runtime.
DevOps
fromMedium
3 weeks ago

The Hidden Cost Centers in Kubernetes No One Tracks-Until the Cloud Bill Explodes

Kubernetes clusters incur hidden costs through idle workloads, oversized resource requests, and poor scheduling practices that drain budgets without delivering proportional value.
fromDevOps.com
1 month ago

Zero Downtime Multicloud Migrations for Observability Control Planes - DevOps.com

An observability control plane isn't just a dashboard. It's the operational authority system. It defines alert rules, routing, ownership, escalation policy, and notification endpoints. When that layer is wrong, the impact is immediate. The wrong team gets paged. The right team never hears about the incident. Your service level indicators look clean while production burns.
DevOps
fromDevOps.com
1 month ago

Harness Readies Resilience Testing Platform to Make Applications More Robust - DevOps.com

The Harness Resilience Testing platform extends the scope of the tests provided to include application load and disaster recovery (DR) testing tools that will enable DevOps teams to further streamline workflows.
DevOps
fromDbmaestro
5 years ago

Database Delivery Automation in the Multi-Cloud World

The main advantage of going the Multi-Cloud way is that organizations can "put their eggs in different baskets" and be more versatile in their approach to how they do things. For example, they can mix it up and opt for a cloud-based Platform-as-a-Service (PaaS) solution when it comes to the database, while going the Software-as-a-Service (SaaS) route for their application endeavors.
DevOps
DevOps
fromTechzine Global
2 months ago

What Microsoft Azure Local can and cannot do

Azure Local delivers Azure cloud functionality on-premises, using Hyper-V/Stack HCI, validated server hardware, and Azure Portal management for gradual hybrid migration.
[ Load more ]