Hi everybody and happy new year to all of you.
I am asking for your help because I am really not getting anywhere with my project. So you guys are my last hope.
I am a newbie to data mining and I am doing my best to learn the basics so that I get started with my project.
I guess my problem, is the fact that I am new to data mining and I have no clue how to get started. The approach I have been following is trying to read books about data mining and machine learning so that I can understand and compare all the "numerous" algorithms out there and then try to find to the one(s) that would apply to my case.
The first problem with this approach is that I did not find any good resources (either the subjects are treated on the surface or they are overly made complex and hard to follow through). The second problem is that it is incredibly time consuming. So I am wondering if I should continue in this path or if I should try to proceed differently.
I am sure that a lot of you guys have been in the same position and some of you have struggled with this problem just like me. So, I am hopping that you guys would suggest a method that would help me get started.
The project I am working on is related to the field of agriculture and has as objective to try to find the best values of all the parameters that affect the outcome (the amount of meat produced) of an animal production (could be dairy, poultry, porch, etc...)
So as I said, the approach is to run one or more algorithms on historical data for a certain type of production (poultry for example) and trying to find what should be the best values for the operating conditions that would maximize the growth of the animals (weight), while trying to minimize the production costs. A few examples of the questions that this project is trying to solve are as follows: when is the best time and how long should the barns be light? When and how much food should we give the animals? What is the best operating temperature set point? When and how much cooling/heating should be done? , etc....
As you noticed, all these questions are concerned with the optimization of the operating conditions but most importantly, the reduction of operating costs. Huge amounts (10's of Go) of historical data for these operating conditions are to be used for this purpose.
PS: I am trying to use the Weka learning environment (java based and open-source).
I hope that you guys would kind enough to help me work my way trough this. I would appreciate your help and advice and I thank in advance all of you who took the time to read this lengthy post
Cheers.
I am asking for your help because I am really not getting anywhere with my project. So you guys are my last hope.
I am a newbie to data mining and I am doing my best to learn the basics so that I get started with my project.
I guess my problem, is the fact that I am new to data mining and I have no clue how to get started. The approach I have been following is trying to read books about data mining and machine learning so that I can understand and compare all the "numerous" algorithms out there and then try to find to the one(s) that would apply to my case.
The first problem with this approach is that I did not find any good resources (either the subjects are treated on the surface or they are overly made complex and hard to follow through). The second problem is that it is incredibly time consuming. So I am wondering if I should continue in this path or if I should try to proceed differently.
I am sure that a lot of you guys have been in the same position and some of you have struggled with this problem just like me. So, I am hopping that you guys would suggest a method that would help me get started.
The project I am working on is related to the field of agriculture and has as objective to try to find the best values of all the parameters that affect the outcome (the amount of meat produced) of an animal production (could be dairy, poultry, porch, etc...)
So as I said, the approach is to run one or more algorithms on historical data for a certain type of production (poultry for example) and trying to find what should be the best values for the operating conditions that would maximize the growth of the animals (weight), while trying to minimize the production costs. A few examples of the questions that this project is trying to solve are as follows: when is the best time and how long should the barns be light? When and how much food should we give the animals? What is the best operating temperature set point? When and how much cooling/heating should be done? , etc....
As you noticed, all these questions are concerned with the optimization of the operating conditions but most importantly, the reduction of operating costs. Huge amounts (10's of Go) of historical data for these operating conditions are to be used for this purpose.
PS: I am trying to use the Weka learning environment (java based and open-source).
I hope that you guys would kind enough to help me work my way trough this. I would appreciate your help and advice and I thank in advance all of you who took the time to read this lengthy post
Cheers.