The set up is using Oracle 9ias - apache front end, oc4j back end. From what
we have been able to discern, it is the actual header. The site uses the
Struts framework, so the actual urls submitted would be for example
(http://mysite/home.do), where ".do" is just a servlet mapping. When we have
tried to follow the one link that has been spidered, it actually contains
the full path that appears in the "Content-Location" header (i.e.
http://mysite/web-inf/jsps/bogus.jsp) which in this case can't even be
accessed. The feedback we get from third-party utilities that try to spider
the site is that it stopping because it has already indexed "bogus.jsp",
which in reality will always appear since it is template, where the actual
urls will be different as is above. This is where we've come to the
conclusion that if we could unset or at least alter what is being placed in
that header, we might have more luck.
> > Is there a way to not display, or replace the "Content-Location" http
> > header? - in short, I have an app that is using templating, and because
> > the "Content-Location" header returning the same page name, even though
> > content is different, search engine spiders won't index any further than
> > first page.
> > Any help would be greatly appreciated
> Can you provide more details about the setup or perhaps an example URL.
> Are you sure it is the Content-Location that is the problem, or could
> it simply be that the search engines do not "deep-index" pages that
> look dynamic?