May
31
Well, the first page went up on my new site on the 10th of May, pages were added on the 12th of May and the site was already ranking for few keywords (technical, not commercial) by the middle of last week. It took about 10 days total, from when content first went up to ranking on a phrase with 145,000 pages returned.
This page, ranking at Google for mod_rewrite htacess
There are, as of now, a total of 18 pages on the site and they are all indexed at Google. One inbound link, though a couple have been added since then. There won’t be a “link campaign” as such, the site is far from ready and there are no commercial purposes, but truthfully I just wanted to see how the Google “sandbox” effect would work with this domain. I assume it’ll be safe from site scrapers, since it isn’t a commercial site, is not intended to be, and has no commercial value.
Several things we can look at regarding the “age” of a site:
1. When was the domain first registered?
2. When did the domain first go online, with or without content and with or without any inks.
3. When did the first link to the site/domain first appear?
4. When did Google first find the site/domain via a link?
5. When did the content go up, and what’s the date for pages in the last-modified headers?
6. TrustRank
And then there’s a matter of TrustRank - which is not something that’s totally divorced from “sandboxing” issues, not by a long shot. TrustRank is not brand new, the first publicly published papers I’ve found on it date back to 2001.
May
29
It’s kind of amusing to sit back and watch one in action.
http://forums.searchenginewatch.com/showthread.php?p=48514#post48514
So far, three and counting. ![]()
May
22
Imbedding a Blog into a Web Page
Filed Under CMS and Blogs | Leave a Comment
I’ve been wondering about doing this and just found that it can be done with the paid version of Live Journal.
It may not be the only way to do it, but it does seem like a nice idea to have a section on the homepage of a site that can be updated as easily as just doing a blog entry.
May
22
When trying out different CMS for Webmaster Woman last summer, none of which is being used, I had to nuke one (Drupal) because it was throwing out Session IDs. It seems that now that’s creating a problem with Yahoo because although they’re usually right on it with picking up new sites and pages and including them in the index, while the new site is getting hit by the crawler, all there is in the index is two “pages” - one of which doesn’t even exist.
I believe it may possibly be a duplicate content issue, since during the time of the test install there was a folder called /search and now Yahoo is showing that with the SessionID (which it hits with the crawler constantly) - with a duplicate of what’s on the current temporary homepage of the site.
I’m not sure if it will make a difference with or withouth the trailing slash for /search/ or /search but I’m putting up another temporary page in a directory I’m creating as
http://www.webmasterwoman.com/search/
That’s returning a 404 but this is also being picked up with a SessionID for the homepage, and returning a 200:
http://www.webmasterwoman.com/?PHPSESSID=7f5131f335eb7c2787a98392a04131d3
I’ll see if it will be picked up with whatever new is put on it for the time being and get rid of that Session ID page being hit daily - which doesn’t even exist, Slurp is just still looking for it.
The site was totally crawled in the past day, par for the course with Yahoo, they’re not asleep on the job. Not that it matters much right now because the site isn’t near ready for a launch, but it’s a challenge to deal with this issue, which I’ve never had to before.