Google News Scraper

Google News Scraper is a robust, enterprise-grade Apify Actor designed to extract high-quality news metadata from Google News. It uses a custom interaction engine that bypasses limitations of dynamic classes, ensuring long-term stability and high reliability.

Features

📰 Query-based Search: Extract articles for any keyword or complex search query.
🎯 Dynamic Topic Selection: Scrape specific news sections like World, Business, Technology, Entertainment, Sports, Science, and Health.
🌍 Localization Support: Target news from specific countries and in specific languages using Google's hl, gl, and ceid parameters.
🕒 Timeframe Filtering: Filter results by the last hour, day, week, month, or year.
🛡️ Proxy Integration: Seamlessly works with Apify Proxy to avoid blocks and geofencing.
🚀 Performance Optimized: Replaced heavy computation (like redirection following or full text extraction) with an ultra-fast metadata engine.

What data does Google News Scraper collect?

The scraper extracts the following fields for each article:

Field	Description
`title`	The headline of the article.
`link`	The direct Google News redirect link to the article.
`image`	URL of the featured image (fallback to placeholder if missing).
`source`	Name of the news organization (e.g., The New York Times).
`sourceIcon`	URL of the publisher's favicon/logo.
`author`	The author's name (extracted from the article card).
`datetime`	ISO timestamp of publication.
`time`	Human-readable publication time (e.g., "5 hours ago").

How to use Google News Scraper

Query-based Search

Simply enter your search query in the searchTerm field. You can use advanced search operators just like on Google:

"Artificial Intelligence" for exact matches.
site:reuters.com to limit results to a specific domain.
intitle:Nvidia to find keywords in headlines.

Topic-based Search

Instead of a search term, you can select a News Section from the dropdown. The scraper will navigate to the home page and dynamically click the corresponding tab (World, Business, etc.) to ensure it gets the latest curated topics.

Localization

Set the language (e.g., en) and country (e.g., US) to get localized results. This adjusts the Google News region settings automatically.

Input Parameters

Field	Type	Default	Description
`searchTerm`	string	—	The keyword to search for.
`section`	enum	`Default`	Predefined Google News topic to scrape.
`maxArticles`	integer	99	Maximum number of results to return.
`timeframe`	string	`7d`	Filter by time (1h, 1d, 7d, 1m, 1y).
`language`	string	`en`	Language code for results.
`country`	string	`US`	Country code for results.

Example Output

{
  "title": "Donald Trump says Iran war will end ‘very soon’",
  "link": "https://news.google.com/read/CBMicEFVX3lxTE1ZZTc0Sm9BSXpmc3BkeC...",
  "image": "https://news.google.com/api/attachments/CC8iK0N...",
  "source": "Financial Times",
  "sourceIcon": "https://encrypted-tbn2.gstatic.com/faviconV2?url=https://www.ft.com...",
  "author": "By James Politi, Lauren Fedor & Jamie Smyth",
  "datetime": "2026-03-10T06:07:43Z",
  "time": "6 hours ago",
  "articleType": "regular"
}

Why Choose This Scraper?

Unlike other scrapers that rely on fragile CSS classes, this Actor targets stable functional attributes (jscontroller, data-n-tid) used by Google. It also handles cookie consent dialogs automatically and uses Playwright for a "human-like" interaction that is significantly less likely to be blocked.