Home | Contact Us | FAQ | Search & Site Map | Link to Us
Sign In | Join | Other 45 Sites in Network
HomeAnnouncementsFree MagazinesWhite PapersSubmit Content
Discussion GroupsASP.NETWindows FormsLanguages.NET FrameworkVisual Studio.NET
Articles.NET FrameworkASP.NETToolsWindows Forms
.NET DirectoryOpen Source ProjectsUser GroupsWeb Resources
Related Topics
Visual Basic 6SQL ServerMS AccessOther DB ProductsMS Server ProductsMore Topics ...

.NET Forum / ASP.NET / General / July 2007

Tip: Looking for answers? Try searching our database.

/robots.txt at end of URL?

Thread view: 
Enable EMail Alerts  Start New Thread
Thread rating: 
Les Caudle - 20 Jul 2007 15:22 GMT
I'm noticing that web requrests are coming in with /robots.txt appended at the
end:

http://www.domain.com/ProductDetails.aspx?productID=527/robots.txt

I can correct these, one by one for each page, but I'd like to find a way to
have ASP.NET 2.,0 strip this invalid /robots.txt off the end of any URL for me.

Is this possible?
--
Thanks in advance, Les Caudle
George Ter-Saakov - 20 Jul 2007 15:28 GMT
Let me asking why do you want to strip it?
robors.txt is asked by automatic robots that collect information. Like
google or yaho.

But in your case apparently it's some lame robot that can not parse out url
correctly simply ads /robot.txt to the end of url and asks for it from the
server.
So why would you worry what kind of garbage it will get in return from your
page.
All important robots you should be worring about like Google or Yahoo parse
out url correctly.

So i am saying just ignore it. It's not human.

George.

> I'm noticing that web requrests are coming in with /robots.txt appended at
> the
[quoted text clipped - 10 lines]
> --
> Thanks in advance, Les Caudle
Les Caudle - 20 Jul 2007 17:32 GMT
Well, it creates an event in my event log that would distract me from real
events caused by users who had problems.

Would be nice to be able to globally deal with it.

Regards, Les Caudle

>Let me asking why do you want to strip it?
>robors.txt is asked by automatic robots that collect information. Like
[quoted text clipped - 26 lines]
>> --
>> Thanks in advance, Les Caudle
George Ter-Saakov - 20 Jul 2007 18:21 GMT
Well, the problem is that this time it's robot.txt. Next time (with another
bad robot) it will be something else.
You can not fix it for every bad robot that out there.
Override Application_OnError and send an email to yourself every time
something bad happened. Then you can use Email's rules to filter out the
most annoying ones

George.

> Well, it creates an event in my event log that would distract me from real
> events caused by users who had problems.
[quoted text clipped - 42 lines]
>>> --
>>> Thanks in advance, Les Caudle
Walter Wang [MSFT] - 22 Jul 2007 02:14 GMT
Hi Les,

I agree with George, as long as your web site can be accessed publicly, you
cannot guarantee every URL request is valid or in expected manner.

Regards,
Walter Wang (wawang@online.microsoft.com, remove 'online.')
Microsoft Online Community Support

==================================================
When responding to posts, please "Reply to Group" via your newsreader so
that others may learn and benefit from your issue.
==================================================

This posting is provided "AS IS" with no warranties, and confers no rights.

Free Magazines

Get these publications absolutely FREE for up to 12 months. There are no hidden fees and no obligation. Simply choose a title, complete the application form and submit it. Read more ...

Oracle MagazineNetwork ComputingComputer WorldBio-IT WorldeWeekInformation WeekInfosecurity
 
Sign In
Join
My Latest Posts
My Monitored Threads
My Blog
My Photo Gallery
My Profile
My Homepage

Start New Thread
Enable EMail Alerts
Rate this Thread



©2008 Advenet LLC   Privacy Policy - Terms of Use
This website includes both content owned or controlled by Advenet as well as content owned or controlled by third parties.