1 fontfile arialuni.ttf 2 fontname Arial Unicode MS 3 database encoding utf-8 4 database communication encoding utf-8 5 content encoding utf-8 6 browser version 1.0 7 build release Medusa-1.4-B774 8 build date 2011-11-15 9 build user wortschatz 10 build location aspra5:/disk1/users/wortschatz/WSToolchain/tools/Medusa-1.4-B774 11 build file encoding UTF-8 12 build system Linux on amd64 13 build architecture model 64 14 build java version 1.6.0_15 15 release date 04-März-2008 10:09 AM 16 release user mbuechler 17 corpus file /disk1/users/wortschatz/WSToolchain/workingDir/por_br_newscrawl_2011_100K/por_br_newscrawl_2011_100K.work_file 18 used memory 5726666752 19 WORD_TOKENS 2218161 20 de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl.SIG_COOCCURRENCE_TOKENS 1287042 21 de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl.SIG_COOCCURRENCE_TYPES 84776 22 de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl.n 2118161 23 de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl.SIG_COOCCURRENCE_TYPES 332200 24 de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedListFilterImpl.n 100000 25 SENTENCES 100000 26 de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl.n 100000 27 BOW_WORD_TOKENS 2032306 28 de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl.SIG_COOCCURRENCE_TOKENS 11831264 29 de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl.COOCCURRENCE_TYPES 1721106 30 WORD_TYPES 100123 31 de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedSourceListFilterImpl.n 100000 32 de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl.COOCCURRENCE_TYPES 107926 33 auto mwu detection enabled false 34 duration of executing de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl 0 hour(s) 1 min 46,57 sec 35 duration of exporting de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl 0 hour(s) 0 min 5,946 sec 36 duration of word numbers generation 0 hour(s) 0 min 46,704 sec 37 duration of executing de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedSourceListFilterImpl 0 hour(s) 0 min 0,76 sec 38 duration of executing de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl 0 hour(s) 0 min 53,101 sec 39 duration for generation wnc file 0 hour(s) 0 min 0,308 sec 40 duration of executing de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedListFilterImpl 0 hour(s) 0 min 57,318 sec 41 most frequent word , 42 duration of generation bow frequencies 0 hour(s) 0 min 1,163 sec 43 duration of wswn transformation 0 hour(s) 0 min 20,398 sec 44 duration of tokenisation 0 hour(s) 0 min 17,591 sec 45 duration of exporting de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedListFilterImpl 0 hour(s) 0 min 2,627 sec 46 most frequent word's frequency 122048 47 duration of exporting de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl 0 hour(s) 0 min 0,895 sec