Hi there,
I tried running the programme with 100k segments in-domain, and 100k segments mix-domain dedicating 2G RAM memory in the command line and I received out of memory exceptions. However, when I ran the same command on 50k in-domain and 100k mix-domain, there was no exceptions and code ran fine.
I was wondering if you have tested to find the optimum values for in and mix domain number of segments, and also if is there a way to increase this limitation to be able to deal with much larger corpora. For instance, millions of segments both in and mix domain.
Regards
Hi there,
I tried running the programme with 100k segments in-domain, and 100k segments mix-domain dedicating 2G RAM memory in the command line and I received out of memory exceptions. However, when I ran the same command on 50k in-domain and 100k mix-domain, there was no exceptions and code ran fine.
I was wondering if you have tested to find the optimum values for in and mix domain number of segments, and also if is there a way to increase this limitation to be able to deal with much larger corpora. For instance, millions of segments both in and mix domain.
Regards