Support #8122
closedMeta data robots twice
0%
Description
Hello,
I have found the 2 topic abouts Metadata header tags.
https://redmine.webtoolkit.eu/boards/2/topics/16644
https://redmine.webtoolkit.eu/boards/2/topics/15203
I have set the meta tag for robots to "index, follow" in the wt_config.xml, but I always get a second meta tag in my header
<meta name="robots" content="noindex, nofollow" />
I get this as final :
<head>
<meta name="robots" content="all"/>
<meta name="description" lang="fr" content="Forestimator: outils cartographiques pour la gestion des forĂȘts Wallonnes">
<meta name="viewport" content="width=device-width, initial-scale=1">
<meta name="robots" content="noindex, nofollow">
How could I get rid of it ?
Many thanks
Updated by Korneel Dumon over 3 years ago
- Status changed from New to Resolved
As the second link says, Wt will only add this second header when it does not recognize your browser as a bot. This can be a bit tricky to test, try something like this:
curl -H "User-Agent: Googlebot/2.1" http://localhost:8080
The list of bots for which this works is also in wt_config (under <user-agents type="bot">
)
Updated by Samuel Quevauvillers over 3 years ago
Ok thank you but there is still something I don't understand. How can I allow bots to index my site ?
I notice with the the curl cmd that if I run it as bot or not, I always get a <meta name="robots" content="noindex, nofollow">
Updated by Samuel Quevauvillers over 3 years ago
Ok now this is working, I had to set back the bot list in my wt_config.xml :
<user-agents type="bot">
<user-agent>.*Googlebot.*</user-agent>
<user-agent>.*msnbot.*</user-agent>
<user-agent>.*Slurp.*</user-agent>
<user-agent>.*Crawler.*</user-agent>
<user-agent>.*Bot.*</user-agent>
<user-agent>.*ia_archiver.*</user-agent>
<user-agent>.*Twiceler.*</user-agent>
<user-agent>.*Yandex.*</user-agent>
<user-agent>.*Nutch.*</user-agent>
<user-agent>.*MJ12bot.*</user-agent>
<user-agent>.*Baiduspider.*</user-agent>
<user-agent>.*Ezooms.*</user-agent>
<user-agent>.*Sogou web spider.*</user-agent>
<user-agent>.*AhrefsBot.*</user-agent>
</user-agents>
Updated by Roel Standaert over 3 years ago
- Status changed from Resolved to Closed