Hi,
Does anyone have a pointer for me on where to start searching on how I do this?
I have a database of many many addresses, but they are in free text form: ie: Thomas Eddison, 231st E Street, California, 94201, USA.
However, they are not "conforming". For example, some will be E, some will be East, Some will use Street, some St., California can be CA. USA can be United States of America, and so on and so forth.
From the free text, I need to extact out the Name, Street, City, State, Zipcode & country.
Have anyone does this kind of extraction before? I understand that this is complicated as it also need to do data cleansing ... Any pointer on how I can achieve this?
Sharing of prev experience will be greatly appreciated. Thanks a million in advance.
~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-
Does anyone have a pointer for me on where to start searching on how I do this?
I have a database of many many addresses, but they are in free text form: ie: Thomas Eddison, 231st E Street, California, 94201, USA.
However, they are not "conforming". For example, some will be E, some will be East, Some will use Street, some St., California can be CA. USA can be United States of America, and so on and so forth.
From the free text, I need to extact out the Name, Street, City, State, Zipcode & country.
Have anyone does this kind of extraction before? I understand that this is complicated as it also need to do data cleansing ... Any pointer on how I can achieve this?
Sharing of prev experience will be greatly appreciated. Thanks a million in advance.
~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-