Question About GSC Coverage Report - Excluded URLs

Roger Marquez

Elite Member
Executive VIP
Jr. VIP
Joined
Apr 17, 2017
Messages
5,080
Reaction score
7,745
I am currently working on a technical audit for a client´s site and it has 347x URLs under the status "Discovered - currently not indexed".

I checked and all these URLs have two trailing slashes at the end...Weird ah?

But that´s not why I am opening this thread. I just crawl these 347x URLs with SF and they all have a 301 redirect to the same page but without the two trailing slashes.

Everything seems to be ok in this regard and so if that´s the case...How can I remove those 347x URLs from the excluded section?

The client is a bit paranoid about this and he sees the URLs under the "Excluded" section as "Errors".

Is there a way to remove those URLs and make my client a happy man?

Cheers!
 
ask google to crawl the site again :D
Discovered - currently not indexed: The page was found by Google, but not crawled yet. Typically, Google wanted to crawl the URL but this was expected to overload the site; therefore Google rescheduled the crawl. This is why the last crawl date is empty on the report.
 
Check if you have double slashes in your database, most likely it does which is why GSC is picking up with it

You can run a sql code, or thru wp-cli or something similar to check

A SQL code could be something like
Code:
UPDATE wp_options SET option_value = replace(option_value, 'https://domain.com//',
'https://domain.com/') WHERE option_name = 'home' OR option_name = 'domain.com';

UPDATE wp_posts SET guid = replace(guid, 'https://domain.com//','https://domain.com/');

UPDATE wp_posts SET post_content = replace(post_content, 'https://domain.com//',
'https://domain.com/');

UPDATE wp_postmeta SET meta_value = 
replace(meta_value,'https://domain.com//','https://domain.com/');
 
Back
Top