Practical Tools

apify-google-news-scraper

Pay per result Β· No subscription

Google News Scraper

Google News Scraper is a robust, enterprise-grade Apify Actor designed to extract high-quality news metadata from Google News. It uses a custom interaction engine that bypasses limitations of dynamic classes, ensuring long-term stability and high reliability.

Features

  • πŸ“° Query-based Search: Extract articles for any keyword or complex search query.
  • 🎯 Dynamic Topic Selection: Scrape specific news sections like World, Business, Technology, Entertainment, Sports, Science, and Health.
  • 🌍 Localization Support: Target news from specific countries and in specific languages using Google's hl, gl, and ceid parameters.
  • πŸ•’ Timeframe Filtering: Filter results by the last hour, day, week, month, or year.
  • πŸ›‘οΈ Proxy Integration: Seamlessly works with Apify Proxy to avoid blocks and geofencing.
  • πŸš€ Performance Optimized: Replaced heavy computation (like redirection following or full text extraction) with an ultra-fast metadata engine.

What data does Google News Scraper collect?

The scraper extracts the following fields for each article:

Field Description
title The headline of the article.
link The direct Google News redirect link to the article.
image URL of the featured image (fallback to placeholder if missing).
source Name of the news organization (e.g., The New York Times).
sourceIcon URL of the publisher's favicon/logo.
author The author's name (extracted from the article card).
datetime ISO timestamp of publication.
time Human-readable publication time (e.g., "5 hours ago").

How to use Google News Scraper

Query-based Search

Simply enter your search query in the searchTerm field. You can use advanced search operators just like on Google:

  • "Artificial Intelligence" for exact matches.
  • site:reuters.com to limit results to a specific domain.
  • intitle:Nvidia to find keywords in headlines.

Topic-based Search

Instead of a search term, you can select a News Section from the dropdown. The scraper will navigate to the home page and dynamically click the corresponding tab (World, Business, etc.) to ensure it gets the latest curated topics.

Localization

Set the language (e.g., en) and country (e.g., US) to get localized results. This adjusts the Google News region settings automatically.

Input Parameters

Field Type Default Description
searchTerm string β€” The keyword to search for.
section enum Default Predefined Google News topic to scrape.
maxArticles integer 99 Maximum number of results to return.
timeframe string 7d Filter by time (1h, 1d, 7d, 1m, 1y).
language string en Language code for results.
country string US Country code for results.

Example Output

{
  "title": "Donald Trump says Iran war will end β€˜very soon’",
  "link": "https://news.google.com/read/CBMicEFVX3lxTE1ZZTc0Sm9BSXpmc3BkeC...",
  "image": "https://news.google.com/api/attachments/CC8iK0N...",
  "source": "Financial Times",
  "sourceIcon": "https://encrypted-tbn2.gstatic.com/faviconV2?url=https://www.ft.com...",
  "author": "By James Politi, Lauren Fedor & Jamie Smyth",
  "datetime": "2026-03-10T06:07:43Z",
  "time": "6 hours ago",
  "articleType": "regular"
}

Why Choose This Scraper?

Unlike other scrapers that rely on fragile CSS classes, this Actor targets stable functional attributes (jscontroller, data-n-tid) used by Google. It also handles cookie consent dialogs automatically and uses Playwright for a "human-like" interaction that is significantly less likely to be blocked.