search - Lucene.net with large files -
i wondering if possible index , store number of large files each on 4gb in size. had no problems doing , searching documents 1 exception - not able retrieve , highlight content of matched documents. code below allows me create searchable index without running out of memory.
var doc = new document(); doc.add(new field(string, textreader)); changing line below result in out of memory exception.
new field(string, textreader.readtoend(), field.store.yes, field.index.analyzed) i able index , store 28 files 150mb each allowed me search , retrieve matched text. however, query performance unacceptable , after 2 or 3 searches out of memory exception thrown. understand reason exception , why occurs. question community missing something? there functionality within lucene api addresses problem? have solution splits files , achieve without having scale application horizontally across multiple servers , create file chunks.
thanks in advance!
do need store files in lucene index? adds overhead , slows down everything.
simply store these files in file system , have path reference in lucene document (e.g. /path/to/file).
indexing of contents should fine though, providing have adequate amount of ram available.
Comments
Post a Comment