This paper examines how the concepts of dimensional data warehouses can be applied to document retrieval and storage. It then shows how the specifics of dimensional document warehouses differ from dimensional data warehouse and how these differences make it impractical to use existing engines for building and analyzing data cubes (such as SQL Server?s Analysis Manager) in order to build and analyze a document warehouse. The paper further shows that readily available software can be used to build an engine to analyze a dimensional document mart. All of the steps required to design, build, and analyze a dimensional document mart are described and illustrated. Design features are suggested for improving the recall and precision of searches from dimensional document marts.
Citation:
Gregory Schymik, Karen Corral, David Schuff, Robert St. Louis, "Architecting a Dimensional Document Warehouse," hicss, pp.216a, 40th Annual Hawaii International Conference on System Sciences (HICSS'07), 2007