• Home
  • Get help
  • Ask a question
Last post 2 hours 39 min ago
Posts last week 81
Average response time last week 44 min
All time posts 70355
All time tickets 10859
All time avg. posts per day 20

Helpdesk is open from Monday through Friday CET

Please create an (free) account to post any question in the support area.
Please check the development versions area. Look at the changelog, maybe your specific problem has been resolved already!
All tickets are private and they cannot be viewed by anyone. We have made public only a few tickets that we found helpful, after removing private information from them.

#8392 – Sitemap exclusion rules

Posted in ‘4SEO’
This is a public ticket. Everybody will be able to see its contents. Do not include usernames, passwords or any other sensitive information.
Tuesday, 21 December 2021 05:23 UTC
freelab

 

to avoid duplicated and sitemap of dynamic pages,i put those rules for sitemap

exclusions :

/fr/{*}
/en/{*}
/{?}{*}
/{*}{?}{*}
/?own={*}

 

inclusions  :

/categories/{*}

 

i asked to generate again a new sitemap

but still pages are listed against those rules

https://[redacted].fm/sitemap.fr-FR.4seo.1.xml.gz

 

 

 

Tuesday, 21 December 2021 08:51 UTC
wb_weeblr

Hi

Yes, you need to "Reset analysis" and re-run it for the new rules to be taken into account. The sitemap "Rebuild Now" will only clear the cache in case it's damaged.

Please note:

/?own={*} is not needed as you already excluded any page that starts with /? with the previous rule, /{?}{*}

- with /fr/{*} and /en/{*}, aren't you going to just exclude (almost) all the pages on your site?

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 
Tuesday, 21 December 2021 10:50 UTC
freelab

 

 

Yes, you need to "Reset analysis" and re-run it for the new rules to be taken into account.

I did try to restart analysis.

when i lauch it it says directly now that analysis is finished without crawling nothing

that is why i tried sitemap rebuild

 

The sitemap "Rebuild Now" will only clear the cache in case it's damaged.

Please note:

/?own={*} is not needed as you already excluded any page that starts with /? with the previous rule, /{?}{*}

sure i was not 100% sure about your syntax so i put it in double

 

- with /fr/{*} and /en/{*}, aren't you going to just exclude (almost) all the pages on your site?

/fr/* and /en/* are duplicates of /categories/fr/* and /categories/en/* with a different UI ( folders only or folders + tree)

 

Tuesday, 21 December 2021 11:05 UTC
wb_weeblr

Hi

when i lauch it it says directly now that analysis is finished without crawling nothing

This would indicate either:

- you have excluded (from analysis) all the links from your home page

- you are going too fast in the crawling and your server cannot respond.

The screenshots are not really useful, what would matter more are the exclusions rules for both analysis and sitemaps.

Also I checked your robots.txt and there are some odds things there:

- some images rules were inserted inside of the 4SEO sitemap automatic listing

- the automated sitemap rule is outdated, it still has /index.php/ at the start.

I'd suggest manually removing the images section that was added manually (use the robots.txt editor in 4SEO) and then also remove the /index.php/ bit at the start of that line so that it reads: 

Allow: /sitemap-4seo

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 

 
Tuesday, 21 December 2021 15:11 UTC
freelab

i changed the robots.txt

 

i give you the complete view of the rules

 

 

Tuesday, 21 December 2021 15:18 UTC
wb_weeblr

Hi

The rules you created are not for the sitemap. You added them to the pages analysis configuration.

And so this rule /{*}{?}{*} excludes all the pages on your site from analysis, which is why the analysis ends immediately.

I'd suggest:

1 - Delete all the rules your created. You made them complex but they really don't need to be.

2 - Go to to the Sitemaps page, open the Settings and in Included content, enter your exclusion rules, for your sitemap:

Note that the specification I entered is different from what you did: you used {?} to specify a ? but {?} actually means "any single character" and not "the ? character". So you were excluding everything.

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 

 
Friday, 21 January 2022 05:34 UTC
system
This ticket has been automatically closed. All tickets which have been inactive for a long time are automatically closed. If you believe that this ticket was closed in error, please contact us.
This ticket is closed, therefore read-only. You can no longer reply to it. If you need to provide more information, please open a new ticket and mention this ticket's number.