search - Lucene.net with large files -


i wondering if possible index , store number of large files each on 4gb in size. had no problems doing , searching documents 1 exception - not able retrieve , highlight content of matched documents. code below allows me create searchable index without running out of memory.

var doc = new document(); doc.add(new field(string, textreader)); 

changing line below result in out of memory exception.

new field(string, textreader.readtoend(), field.store.yes, field.index.analyzed) 

i able index , store 28 files 150mb each allowed me search , retrieve matched text. however, query performance unacceptable , after 2 or 3 searches out of memory exception thrown. understand reason exception , why occurs. question community missing something? there functionality within lucene api addresses problem? have solution splits files , achieve without having scale application horizontally across multiple servers , create file chunks.

thanks in advance!

do need store files in lucene index? adds overhead , slows down everything.

simply store these files in file system , have path reference in lucene document (e.g. /path/to/file).

indexing of contents should fine though, providing have adequate amount of ram available.


Comments

Popular posts from this blog

python - How to create a legend for 3D bar in matplotlib? -

java - Multi-Label Document Classification -

php - Dynamic url re-writing using htaccess -