Outlier detection taking lot of time and not giving any results
shubham_samant
MemberPosts:2Newbie
I am using Automodel feature in Rapidminer educational license version. My processor is 6 core, i7-8750h with 16 GB ram.The dataset have more than 30000 rows and 12 columns . It have numeric and text data both. I tried running using the best features(as detected by Auto model) and am using the distance based outlier method. The model ran for more than 18 hours but still processing.
I have installed the outlier detection extension too still it didn't help.
How can I solve this issue? Is it because the educational version uses only 1 core?
Kindly help
I have installed the outlier detection extension too still it didn't help.
How can I solve this issue? Is it because the educational version uses only 1 core?
Kindly help
Tagged:
0
Answers
Our data set have more than 100000 records , I reduced the sample size to 30000 if I further reduce the data set to say like 3000 then the sample representation is too small for model training.
I have tried running the full data set in Python applying 2-3 different algorithms and its giving me the results successfully. When I run outlier detection models on python it do not give out of memory issue but in Rapid Miner with relatively smaller data set too it goes out of memory. Why is Automodel –Outlier Detection failing on relatively mid size data sets?