Sitemap and Crawl

I just got a message from Google at Webmaster Tools saying:

Dear owner or webmaster of http://erotic-community.com/
While crawling your site, we have noticed an increase in the number of transient soft 404 errors around 2010-10-09 15:00 UTC (London, Dublin, Edinburgh). Your site may have experienced outages. These issues may have been resolved. Here are some sample pages that resulted in soft 404 errors:
http://erotic-community.com/m/sites/calendar/2008/10
http://erotic-community.com/blogs/entry/Membership
http://erotic-community.com/blogs/entry/Marry-Christmas
http://erotic-community.com/m/articles/calendar/2008/10
http://erotic-community.com/blogs/entry/Embed-

--------------------------------------------------------------

This is probably not good. Is there a way to get rid of such pages?
on another site which crawls my site there I get "Warning: we found 219 errors on your site. Click on Errors to see the..." and the site is just craweled 54%

Same when I generate sitemaps it crawls calender from 1409 and so on. My stand alone sitemap generator crawled only 50000 pages but so many with that calendar stuff.

Hundreds of pages are crawled like:

http://erotic-community.com/m/articles/archive/privacy.php

http://erotic-community.com/m/feedback/index/m/feedback/index/

But from the 1500 profiles I have on my site only a few been crawled.

Does anyone now how to fix this issue?

Diddy is not greedy and has time. Dolphin is cool and its not just mine :-)
Quote · 16 Oct 2010
ok I got all of that calendar stuff out of the crawl. but from my 1500 profiles I have there are only about 7 profiles crawled and if I filter out http://erotic-community.com/m/ then no profile is crawled at all. Does no one here know why?
Diddy is not greedy and has time. Dolphin is cool and its not just mine :-)
Quote · 16 Oct 2010

It wouldnt be because the profiles are private to guests, would it? I know on my site that the profile that are private, they dont get crawled at all.

Quote · 17 Oct 2010

So, are you saying that if a site is "members only," no profiles will be crawled?

Someday, Someway.
Quote · 17 Oct 2010

 

So, are you saying that if a site is "members only," no profiles will be crawled?

I'd imagine a members-only site wouldn't have too much data crawled.

BoonEx Certified Host: Zarconia.net - Fully Supported Shared and Dedicated for Dolphin
Quote · 17 Oct 2010

I guess that's the price of privacy.  Keeping out the riffraff keeps out Google, too.  But how does Facebook do it?  Access requires login.  How do the search bots get through it?

Someday, Someway.
Quote · 17 Oct 2010

Thanks for the info!

I did fix it the following way :-)

I made "view profiles" for non-members visible and went then to dolphin admin builders => Page Block and made there only certain blocks visible for guests and add a block for guests telling them to join to see the entire profile.

By disallow http://erotic-community.com/m/poll/calendar/ for each section like poll, sites, news, articles and so on it will not be crawled anymore with my GSiteCrawler.

My robot.txt looks like shown below and any suggestions a very welcome for user agent:

User-agent: *
Disallow: /administration/
Disallow: /inc/
Disallow: /langs/
Disallow: /xml/
Disallow: /../../calendar/

Diddy is not greedy and has time. Dolphin is cool and its not just mine :-)
Quote · 17 Oct 2010
 
 
Below is the legacy version of the Boonex site, maintained for Dolphin.Pro 7.x support.
The new Dolphin solution is powered by UNA Community Management System.