Marieke van de Rakt is the CEO of Yoast. Her main focus is on growing the company. She is really passionate about maintaining open and friendly company culture. Marieke is also heavily involved in the marketing of Yoast and with Yoast SEO Academy: the online courses platform.
Ranking in search engines requires a website with flawless technical SEO and great, relevant content. Luckily, the Yoast SEO plugin takes care of (almost) everything on your WordPress site. Still, if you really want to get the most out of your website and keep outranking the competition, some basic knowledge of technical SEO is a must. In this post, we’ll explain one of the most important concepts of technical SEO: crawlability.
What is the crawler again?
A search engine like Google consists of a crawler, an index, and an algorithm. The crawler follows the links. When Google’s crawler — also known as Googlebot — finds your website, it’ll render it, read it and save the content in the index.
A crawler follows the links on the web. A crawler is also called a robot, a bot, or a spider. It goes around the internet 24/7. Once it comes to a website, it saves the HTML version in a gigantic database called the index. This index is updated every time the crawler comes around your website and finds a new or revised version of it. Depending on how important Google deems your site and the number of changes you make on your website, the crawler comes around more or less often.
Read more: SEO basics: what does Google do »
And what is crawlability?
Crawlability has to do with the possibilities Google has to crawl your website. You can block crawlers on your site. There are a few ways to block a crawler from your website. If your website or a page on your website is blocked, you’re saying to Google’s crawler: “do not come here.” Your site or the respective page won’t turn up in the search results in most of these cases.There are a few things that could prevent Google from crawling (or indexing) your website:
- If your robots.txt file blocks the crawler, Google will not come to your website or specific web page.
- Before crawling your website, the crawler will take a look at the HTTP header of your page. This HTTP header contains a status code. If this status code says that a page doesn’t exist, Google won’t crawl your website. In the module about HTTP headers of our Technical SEO training we’ll tell you all about that.
- If the robots meta tag on a specific page blocks the search engine from indexing that page, Google will crawl that page, but won’t add it to its index.
This flow chart might help you understand the process bots follow when attempting to index a page:
Want to learn all about crawlability?
Although crawlability is just the basics of technical SEO (it has to do with all the things that enable Google to index your site), it’s already pretty advanced stuff for most people. Nevertheless, if you’re blocking – perhaps even without knowing! – crawlers from your site, you’ll never rank high in Google. So, if you’re serious about SEO, this should matter to you.
If you really want to understand all the technical aspects of crawlability, you should check out our Technical SEO training. This SEO course will teach you how to detect technical SEO issues and solve them (with our Yoast SEO plugin). Yoast SEO Academy is included at no extra cost in your Premium subscription
Keep reading: What is technical SEO: 8 aspects everyone should know »
Coming up next!