"How to speed up logistic regression?"

UsernameUsername MemberPosts:39Guru
edited May 2019 inHelp
Hi,

the logistic regression operator has several parameters. I'm currently using it with the default values. I have ~150 attributes and ~300 000 examples. What parameters can speed up the learning process without decreasing the regression quality too much?

Thanks
Tagged:

Answers

  • stereotaxonstereotaxon MemberPosts:10Contributor II
    Not to disrespect RapidMiner, but if you might want to try R for the analysis.

    make sure you have lots of memory, and fast memory at that.

    -Mike
  • TobiasMalbrechtTobiasMalbrecht Moderator, Employee, MemberPosts:292RM Product Management
    Hi,

    since logistic regression uses an evoluationary approach to find an optimal solution you can reduce the runtime by specifying smaller values for the parameters [tt]max_generations[/tt] or [tt]population_size[/tt]. However you have to be aware, that it is more unlikely to find an optimal solution the more you put constraints on the parameters.

    Besides, there are to Weka operators that use other model fitting approaches. Maybe you might want to try these, so you do not have to leave RapidMiner completely!;)

    Regards,
    Tobias
  • UsernameUsername MemberPosts:39Guru
    The W-Logistic operator from WEKA is indeed much faster:).
  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University ProfessorPosts:1,751RM Founder
    Hi,

    and I want to give a hint to our Kernel logistic regression approach "MyKLRLearner" which in my experience is again faster;)

    Cheers,
    Ingo
Sign InorRegisterto comment.