<< Lucene eLecture [47/117] >>

Lucene eLecture


Home / Projects / Lucene eLecture

Way

Lucene eLecture



Development of a Searchframework for the eLecture portal (http://electures.informatik.uni-freiburg.de/catalog/courses.do) of the Institute for computer science (http://www.informatik.uni-freiburg.de/) at the Albert Ludwig University Freiburg (http://www.uni-freiburg.de/).
Based on Lucene (http://lucene.apache.org/) (of Jakarta (http://jakarta.apache.org/)) a Searchframework was to be provided.
The volume of data which can be indicated amounts approximately altogether 360GB and essentially consists of the following file formats:
- PDF (portable document format)
- PPT (Microsoft Office PowerPoint)
- LPD (Lecturnity recordings)
- LPD with video (Lecturnity recordings with video)
- Flash (Macromedia
- AVI (Audio-Video-Interleave)

For the indexing of these formats appropriate parsers researched and applied like e.g. PDFBox (http://www.pdfbox.org/) (PDF) and jakarta POI (http://jakarta.apache.org/poi/hslf/index.html) (PPT). For indexing AVI-files an already present index of the search-engine AVISearch (http://ira.informatik.uni-freiburg.de/cgi-bin/avisearch/avisearch.cgi) used.
The finished search-engine is reachable on http://electures.informatik.uni-freiburg.de/search/init.do (http://electures.informatik.uni-freiburg.de/search/init.do).


Conception
- Wolfgang Hürst (http://ad.informatik.uni-freiburg.de/~huerst/
- Stephan Trahasch (http://ad.informatik.uni-freiburg.de/~trahasch/
Developer
- Markus Krebs 
- Hua Zhang (struts


Details

Language(s)JAVA
TechnologiesJSP, Servlets, Struts, mySQL, Lucene
Tasks to solveTraining in Lucene, research in nutch (http://lucene.apache.org/nutch/) and Red Piranha (http://red-piranha.sourceforge.net/), indexing of PDF, PPT, LPD (Lecturnity) and AVI, creation of a crawler for the file-server, actualization of the index with new data, creation of a front-end in JSP on Apache Tomcat (http://jakarta.apache.org/tomcat/), miscellaneous database-queries, database-access over Tomcat-DataSource
Statefinished and publicly available
Creation time area15.07.2005 to 08.02.2006

Visitors PageClicks Valid XHTML 1.0! Valid CSS!

CanciAbout meSite-MapRightsContactJSWins (JavaScript-Desktop-System)© 2004-2013 by Markus Krebs