<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 12 (filtered medium)">
<title>Message</title>
<style>
<!--
/* Font Definitions */
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
        {font-family:Tahoma;
        panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:12.0pt;
        font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
p
        {mso-style-priority:99;
        mso-margin-top-alt:auto;
        margin-right:0in;
        mso-margin-bottom-alt:auto;
        margin-left:0in;
        font-size:12.0pt;
        font-family:"Times New Roman","serif";}
span.EmailStyle18
        {mso-style-type:personal;
        font-family:"Calibri","sans-serif";
        color:#1F497D;}
span.EmailStyle19
        {mso-style-type:personal-reply;
        font-family:"Calibri","sans-serif";
        color:#1F497D;}
.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;}
@page Section1
        {size:8.5in 11.0in;
        margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
        {page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=EN-US link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'>Hi Tom,<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'><o:p> </o:p></span></p>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'>You mentioned a couple of all-or-nothing approaches. If we
strip out message headers the information can be archived, searched and used
without compromising the anonymity of the list members. If the list archive is
a simple text file this should be fairly trivial to accomplish if the list
wills it so. Is there a way our moderator can take a straw poll?<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'><o:p> </o:p></span></p>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'>Matt<o:p></o:p></span></p>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'><o:p> </o:p></span></p>
<div style='border:none;border-left:solid blue 1.5pt;padding:0in 0in 0in 4.0pt'>
<div>
<div style='border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in'>
<p class=MsoNormal><b><span style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'>From:</span></b><span
style='font-size:10.0pt;font-family:"Tahoma","sans-serif"'>
sgvlug-bounces@sgvlug.net [mailto:sgvlug-bounces@sgvlug.net] <b>On Behalf Of </b>Emerson,
Tom (*IC)<br>
<b>Sent:</b> Tuesday, March 25, 2008 7:34 PM<br>
<b>To:</b> SGVLUG Discussion List.<br>
<b>Subject:</b> [SGVLUG] Robots.txt (was: Paging Greg Stark...)<o:p></o:p></span></p>
</div>
</div>
<p class=MsoNormal><o:p> </o:p></p>
<div>
<p class=MsoNormal><span style='font-size:10.0pt;font-family:"Tahoma","sans-serif";
color:blue'>-----Original Message-----<b> </b>Matt Campbell</span><span
style='font-family:"Courier New";color:blue'><o:p></o:p></span></p>
</div>
<blockquote style='border:none;border-left:solid blue 1.5pt;padding:0in 0in 0in 4.0pt;
margin-left:3.75pt;margin-top:5.0pt;margin-right:0in;margin-bottom:5.0pt'>
<p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri","sans-serif";
color:#1F497D'>What’s involved in writing a robot to strip out the headers for
all the messages in our archive? That way it would be less invasive to
have everything available through Google.</span><span style='color:blue'><o:p></o:p></span></p>
</blockquote>
<div>
<p class=MsoNormal><span style='font-family:"Courier New";color:blue'>it is not
a robot on our side, but rather instructions to /Google's/ robot (or Yahoo's,
Altavista's, or any of the gazillion search engines out there) Basically,
it is a simple text file that lists the directories that are "off
limits" to web-spiders or "robots". It is placed in a
known/common location, and all "robots" are /supposed/ to abide by
it.</span><o:p></o:p></p>
</div>
<div>
<p class=MsoNormal> <o:p></o:p></p>
</div>
<div>
<p class=MsoNormal><span style='font-family:"Courier New";color:blue'>As far as
"should our e-mail archive be indexed by the big guys?", I know there
are campers on both sides of this issue, and I'm generally on the
"pro" indexing side of the fence for a simple reason (or two) -- if
someone solves a particularly involved Linux problem on the list, the next
person with that same or similar problem WON'T find the answer if
they aren't a member of our group/list (and even if they are, they have to
THINK about searching our archives in the first place)</span><o:p></o:p></p>
</div>
<div>
<p class=MsoNormal> <o:p></o:p></p>
</div>
<div>
<p class=MsoNormal><span style='font-family:"Courier New";color:blue'>(the
secondary reason is that it increases exposure of our group in particular --
take your case as a prime example: if you found a suitable solution to your
hard drive problems solely by searching "the net" and finding our
archive AND seeing that we were "local", chances are you would
consider stopping in for a meeting or two, right?)</span><o:p></o:p></p>
</div>
<div>
<p class=MsoNormal> <o:p></o:p></p>
</div>
<div>
<p class=MsoNormal><span style='font-family:"Courier New";color:blue'>On the
"anti" side are folks worried about how they may appear to the rest
of the world should one of their sgvlug posts appear in wider circulation than
just this list (ummm... "shouldn't have posted it in the first place"
is usually the counter argument, but even really good things can be
taken "out of context" and seem rather disparaging...)</span><o:p></o:p></p>
</div>
<div>
<p class=MsoNormal> <o:p></o:p></p>
</div>
<div>
<p class=MsoNormal><span style='font-family:"Courier New";color:blue'>Then
there are a few that actively protect their anonymity (sp?) while online, and a
global (or even local) index kind of defeats that purpose (for that,
there is the "x-no-archive" header you can apply to your e-mail
client -- instructions for such are on our site -- but that doesn't stop manual
archiving by packrats like me ;) )</span><o:p></o:p></p>
</div>
<div>
<p class=MsoNormal> <o:p></o:p></p>
</div>
<div>
<p class=MsoNormal> <o:p></o:p></p>
</div>
</div>
</div>
</body>
</html>