I have hundreds of "rich" (doc/rtf/ppt/xls/pdf) and html documents.
using a special tool, "rich" documents are marked by us and html documents
are parsed and stripped (to textual content, without tags or code).
After the tool has finished, we plan to insert each file content into the
I'm wondering what is the best database design for this?
should I use two columns (text and image) to store textual (parsed) content
and the rich files seperatlly? or can I use a text field for the "rich"
files as well?
I have no plan to use ms-sql full text search, just store the "rich" files
AS IS in the database for further processing.