1 fontfile arialuni.ttf 2 fontname Arial Unicode MS 3 database encoding utf-8 4 database communication encoding utf-8 5 content encoding utf-8 6 browser version 1.0 7 build release Medusa-1.4-B774 8 build date 2011-11-15 9 build user wortschatz 10 build location aspra5:/disk1/users/wortschatz/WSToolchain/tools/Medusa-1.4-B774 11 build file encoding UTF-8 12 build system Linux on amd64 13 build architecture model 64 14 build java version 1.6.0_15 15 release date 04-März-2008 10:09 AM 16 release user mbuechler 17 corpus file /disk1/users/wortschatz/WSToolchain/workingDir/por_br_newscrawl_2011_1M/por_br_newscrawl_2011_1M.work_file 18 used memory 5726666752 19 WORD_TOKENS 22206624 20 de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl.SIG_COOCCURRENCE_TOKENS 16032069 21 de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl.SIG_COOCCURRENCE_TYPES 537248 22 de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl.n 21206624 23 de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl.SIG_COOCCURRENCE_TYPES 3248106 24 de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedListFilterImpl.n 1000000 25 SENTENCES 1000000 26 de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl.n 1000000 27 BOW_WORD_TOKENS 20344382 28 de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl.SIG_COOCCURRENCE_TOKENS 185034926 29 de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl.COOCCURRENCE_TYPES 13174886 30 WORD_TYPES 336928 31 de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedSourceListFilterImpl.n 1000000 32 de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl.COOCCURRENCE_TYPES 728626 33 auto mwu detection enabled false 34 duration of executing de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl 0 hour(s) 5 min 18,175 sec 35 duration of exporting de.uni_leipzig.asv.medusa.filter.sidx.IDXSentenceFilterImpl 0 hour(s) 0 min 42,155 sec 36 duration of word numbers generation 0 hour(s) 4 min 14,665 sec 37 duration of executing de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedSourceListFilterImpl 0 hour(s) 0 min 0,739 sec 38 duration of executing de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl 0 hour(s) 1 min 22,536 sec 39 duration for generation wnc file 0 hour(s) 0 min 1,61 sec 40 duration of executing de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedListFilterImpl 0 hour(s) 2 min 21,433 sec 41 most frequent word , 42 duration of generation bow frequencies 0 hour(s) 0 min 4,877 sec 43 duration of wswn transformation 0 hour(s) 0 min 16,220 sec 44 duration of tokenisation 0 hour(s) 2 min 31,644 sec 45 duration of exporting de.uni_leipzig.asv.medusa.filter.sidx.IDXInvertedListFilterImpl 0 hour(s) 0 min 23,903 sec 46 most frequent word's frequency 1224572 47 duration of exporting de.uni_leipzig.asv.medusa.filter.sidx.IDXNeighbourhoodFilterImpl 0 hour(s) 0 min 3,925 sec