Search Engine

James Drabb JDrabb at darden.com
Tue Aug 6 14:00:33 PDT 2002


> -----Original Message-----
> From: Gerard Beekmans [mailto:gerard at linuxfromscratch.org]
> Sent: Tuesday, August 06, 2002 4:33 PM
> To: blfs-support at linuxfromscratch.org
> Subject: Re: Search Engine
> 
> 
> On August 5, 2002 09:12 pm, you wrote:
> > Another thing I just realized is I don't think a spider type
> > of searc engine will work.  The web site dynamically generates
> 
> That makes things a bit more complicated. The thing is, most 
> search engines 
> put their stuff in a database so they need to spider a 
> webpage in order to 
> index it and be able to return it as a result. If you only 
> have dynamic 
> pages, then it's going to be really hard. You'd have to have 
> different search 
> engine databases for different user so every user get his own 
> set of results 
> with the proper links on it. Search engines only work for 
> static pages best 
> that don't need login info. Think of it this way: if your pages are 
> dynamically generated upon login, you'd never have any page 
> with the same 
> info, so a search engine isn't going to be useful since it 
> can't index 
> anytghing that doesnt' exist until somebody logs in.
> 
> 
> -- 
> Gerard Beekmans
> www.linuxfromscratch.org
> 
> -*- If Linux doesn't have the solution, you have the wrong problem -*-
> -- 
> Unsubscribe: send email to listar at linuxfromscratch.org
> and put 'unsubscribe blfs-support' in the subject header of 
> the message
> 

Thanks Gerard,

The only info I am concerned with is the static text within the php pages.
For example here is a snippet of code:

<?
	header("Expires: Mon, 26 Jul 1997 05:00:00 GMT");               // Date in the past
	header("Last-Modified: " . gmdate("D, d M Y H:i:s") . " GMT");  // always modified
	header("Cache-Control: no-store, no-cache, must-revalidate");   // HTTP/1.1
	header("Cache-Control: post-check=0, pre-check=0", false);      // HTTP/1.1
	header("Pragma: no-cache");                                     // HTTP/1.0
	
	require_once("inc_global.php");
	require_once("inc_menu.php");
?>
<html>

<head>
	<title>Dell Computer Purchasing Program</title>
</head>

<body topmargin="0" leftmargin="0" rightmargin="0" bottommargin="0" marginwidth="0" marginheight="0" class="TA">
<? subWriteMenu(BenefitsTab); ?>
<center>
<table width="80%" cellspacing="0" cellpadding="0" border="0">
	<tr>
		<td width="22"><img SRC="/company/<?=$strCompany;?>/images/boxtop_left.gif"></td>
		<td width="100%" height="28" background="/company/<?=$strCompany;?>/images/boxborder.gif">
			<font class="TabIn">DELL Computer Purchase Program</font>
		</td>
		<td width="22"><img SRC="/company/<?=$strCompany;?>/images/boxtop_right.gif"></td>	
	</tr>
	<tr>
		<td width="22" background="/company/<?=$strCompany;?>/images/boxborder_left_white.gif">
			 
		</td>
		<td align="center" bgcolor="#ffffff"><br>
			<table border="0" cellpadding="0" cellspacing="0" width="100%">
			  <tr>
			    <td class="BodyText">
			      <img align="right" border="0" height="32"
				  src="/common/images/delllogo.gif" width="100" valign="top">
			      All Support Center and restaurant <?=$strEmpWordL;?>s are eligible for the Dell
				<%=strEmpWordU%> Computer Purchase Program, EPP. The plan provides a 10%
				discount on select products. Orders are processed with your name and credit.
				You place the order, process the payment and handle any additional
				requirements. It's also your responsibility to handle any and all
				service requirements directly with Dell.</p>

So if a user comes along and searches for EPP or DELL etc this page would be a hit and
show in the results.

Any suggestions for something like this?  Right now the site is ASP on Win2k and I am
converting it to LAMP : )  For win2k I use the M$ Indexing server and it works well to
just index the contents of the files under the website.

Jim Drabb
-- 
-------------------------------------------------------------
Never ask a geek why, just nod your head and slowly back away
-------------------------------------------------------------
James Drabb JR
Programmer Analyst
Darden Restaurants
Business Systems
JDrabb at Darden.com
-- 
Unsubscribe: send email to listar at linuxfromscratch.org
and put 'unsubscribe blfs-support' in the subject header of the message



More information about the blfs-support mailing list