• Home
  • Get help
  • Ask a question
Last post 1 hour 39 min ago
Posts last week 81
Average response time last week 44 min
All time posts 70355
All time tickets 10859
All time avg. posts per day 20

Helpdesk is open from Monday through Friday CET

Please create an (free) account to post any question in the support area.
Please check the development versions area. Look at the changelog, maybe your specific problem has been resolved already!
All tickets are private and they cannot be viewed by anyone. We have made public only a few tickets that we found helpful, after removing private information from them.

#8131 – Site analysis - does not finish

Posted in ‘4SEO’
This is a public ticket. Everybody will be able to see its contents. Do not include usernames, passwords or any other sensitive information.
Tuesday, 07 September 2021 09:09 UTC
smirrederfuchs-gmail-com

hi.

i like your tool very much.
I'm looking forward to supporting more 3d party components like the stackideas components or jReviews as example.

right now i'm wondering about the site analysis feature. i have installed 4SEO on two live websites over a week. the process as it looks does never finish - should that be so?

https://www.screencast.com/t/WV7asxo8jZ
the 23 pending pages are never get finished as it looks like to me, or am I misunderstanding something here?

Tuesday, 07 September 2021 09:14 UTC
wb_weeblr

Hi

the process as it looks does never finish - should that be so?

No, it should finish and then keep going in the background, looking for search.

That said, it all depends on the specific of your site and maybe its internal linking for instance. Some pages may need to be excluded from crawling manually sometimes.

1 - What are the sites real and full URL (please always provide that)

2 - How long did you wait?

There might be some errors happening during the crawl. A first thing to do would be to install our 4LOGS plugin (get it from the JED) directly. This small plugin will let you view and manage all log files on your Joomla sites, including those from 4SEO. 

Looking at any "error" log files can provide insight into what's happening. If that does not help, there are other checks we can do later on.

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 
Tuesday, 07 September 2021 09:46 UTC
smirrederfuchs-gmail-com

1 - What are the sites real and full URL (please always provide that)

https://[redacted].world/

2 - How long did you wait?

over a week right now.

i have attached the latest two error log files from " /forseo/errors". I'm not a expert here, maybe you find some information about what is happening.

Tuesday, 07 September 2021 09:47 UTC
wb_weeblr

Hi

Nothing was attached, which is expected as these are PHP files. I'd suggest providing superadmin credentials instead so that I can look directly into it.

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 
Tuesday, 07 September 2021 09:57 UTC
smirrederfuchs-gmail-com

yes as it looks the filetype ".php" is not supported to upload.

Joomla Super User: 
[redacted] / [redacted]

Backend LINK: 
https://[redacted].world/administrator/[redacted]

htaccess: 
[redacted] / [redacted]

Tuesday, 07 September 2021 09:59 UTC
wb_weeblr

Hi

The .htaccess creds do not seem to work!

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 
Tuesday, 07 September 2021 10:08 UTC
smirrederfuchs-gmail-com

i readded the htaccess login data, please try again.

Tuesday, 07 September 2021 10:24 UTC
wb_weeblr

Hi

Looking into this now, I'll get back to you when I have more information. There are a number of errors recorded, from 4SEO and mostly from JReviews. I need to investigate what's causing what.

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 
Tuesday, 07 September 2021 10:34 UTC
wb_weeblr

Hi Again

Actually I am restarting the analysis from scratch using the latest dev version. Can you hold on doing anything with 4SEO?

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 
Tuesday, 07 September 2021 10:36 UTC
smirrederfuchs-gmail-com

I understand, i have deactivated the cronjob to 4SEO too.

Tuesday, 07 September 2021 11:12 UTC
wb_weeblr

Hi again,

OK, from what I see this is coming from your events page. It has a calendar and links that always go to the "next day" and "previous" days and so it's kinda of an infinite crawl: 4SEO loads the events page, which is set to today, it finds a link to tomorrow, analyzes that and finds a link to the next day and so on.

And so the analysis can never end because there will always be a tomorrow!

This is a problem for 4SEO but more importantly it's a problem for Google. They probably already stopped crawling your site due to that or if they spend crawl budget on this, they are not crawling other pages of the site.

Most calendar extensions exclude calendar pages from search engines crawling by adding them to robots.txt or adding a nofollow attribute to all calendar links.

If you do that, then 4SEO will automatically pick these up and stop trying to crawl your calendar.

Note that this is something that should also be discussed with StackIdeas, they should not be doing that but instead properly exclude calendar links.

For a current workaround, I suggest excluding 4SEO from crawling pages with an address that include a date (under Pages | Settings | Site analysis):

I have just done that and restarted an analysis, which I'll let run until it finishes.

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 

 
Tuesday, 07 September 2021 11:27 UTC
smirrederfuchs-gmail-com

OK i understand, thank you meanwhile for the workaround!
I will consult with stackideas about there calendar implantation.

Tuesday, 07 September 2021 11:51 UTC
wb_weeblr

Hi

Looks like this was the problem, the analysis completed in a half-hour of so with those pages excluded.

I will consult with stackideas about there calendar implantation

Yes, on your site, the actual events will/should be accessible to search engines from the "Events" page (https://[redacted].world/events).

This page does not have infinite links itself, except for the calendar module. That's why links on the calendar should be nofollow-ed so that search engines don't give up on you. Likewise for the Next/previous links displayed at the top of the page when on /events/date.

I'll look into excluding these automatically when using Jomsocial (or excluding JomSocial entirely, this is not really search engines content actually) but this might be a bit difficult to do as URLs are not the same from one site to another.

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 
Tuesday, 07 September 2021 11:55 UTC
smirrederfuchs-gmail-com

 

Looks like this was the problem, the analysis completed in a half-hour of so with those pages excluded.

Perfect, thank you for your support!

I'll look into excluding these automatically when using Jomsocial (or excluding JomSocial entirely, this is not really search engines content actually) but this might be a bit difficult to do as URLs are not the same from one site to another.

it is about "Easysocial" from stackideas :)

Tuesday, 07 September 2021 11:56 UTC
wb_weeblr

Hi

it is about "Easysocial" from stackideas :)

Yes, I know, fingers write different from brain :)

Best regards

Yannick Gaultier

weeblr.com / @weeblr

 

 
Friday, 08 October 2021 05:34 UTC
system
This ticket has been automatically closed. All tickets which have been inactive for a long time are automatically closed. If you believe that this ticket was closed in error, please contact us.
This ticket is closed, therefore read-only. You can no longer reply to it. If you need to provide more information, please open a new ticket and mention this ticket's number.