shellsharks,

I’m experiencing a Google / robots.txt issue of some kind. According to my Google Search Console, my site (shellsharks.com) is crawlable and indexable, but it is blocked because of my robots.txt file. When inspecting the robots.txt file WITH the Google search console it says everything is fine and I should not be blocked. Up until recently, I had a “noindex” meta tag in my HTML markup but have since deleted that line so that shouldnt be an issue any more. Anyone have any ideas why Google still believes my site is blocked by robots.txt? Is this just a wait until Google can organically crawl my site again after having deleted the noindex meta tag?

Appreciate any help! Right now nothing on my site is findable it seems via Google which is kinda a bummer =/.

#seo #google #fedihelp #webdev #webdevelopment #webdeveloper

dwsmart,
@dwsmart@seocommunity.social avatar

@shellsharks

Your robots.txt starts with this

User-agent: *
Sitemap: https://shellsharks.com/sitemap.xml

User-agent: GPTBot
Disallow: /

Because there is no directive for user-agent: * , just the sitemap entry, this is parsed as

User-agent: *
User-agent: GPTBot
Disallow: /

Or for everything else and GPTBot, block everything

Changing this to

User-agent: *
Disallow:
Sitemap: https://shellsharks.com/sitemap.xml

User-agent: GPTBot
Disallow: /

Will solve it.

shellsharks,

@dwsmart This seems to be the issue. Thanks so much!!!

dwsmart,
@dwsmart@seocommunity.social avatar

@shellsharks glad that sorted it!

shellsharks,

@dwsmart Hey! So the new robots.txt file seems to have made my site crawlable+indexable but for some reason I can’t get Google to just crawl / index my entire site. Even after seemingly successfully submitting my site map via the search console tool. Any ideas on why things might be failing? Thanks in advance!

dwsmart,
@dwsmart@seocommunity.social avatar

@shellsharks there can be many reasons, the main one perhaps being Google just doesn't tend to index everything on a site, & is definitely less eager to do so than in the past

Couple of things:

  • use the URL inspection tool in search console (1st step, not live test), what status do they have?
  • Do the pages that aren't indexed have good internal links for other pages on your site?
  • Do the pages that aren't getting index offer content that might be useful to searchers?
shellsharks,

@dwsmart I actually dug in a bit more to see what was not indexed and it seems it’s limited to only two areas of my site

Which kinda makes sense I think, though my site map does include both these post types (e.g. “notes” and “logs”). Looks like in the site map they are “discovered” but not crawled.

Both of these post types have good linkage and I think would be considered “useful” fwiw. Thanks!

  • All
  • Subscribed
  • Moderated
  • Favorites
  • SEO
  • Durango
  • DreamBathrooms
  • khanakhh
  • GTA5RPClips
  • osvaldo12
  • magazineikmin
  • mdbf
  • InstantRegret
  • rosin
  • Youngstown
  • slotface
  • everett
  • kavyap
  • ngwrru68w68
  • megavids
  • modclub
  • tester
  • tacticalgear
  • cubers
  • thenastyranch
  • cisconetworking
  • ethstaker
  • Leos
  • provamag3
  • normalnudes
  • anitta
  • JUstTest
  • lostlight
  • All magazines