Hi!
I would need your help to extract specific strings from a word document.
Actually, one of my colleagues typed in math formulas (thousands of them) in MS Word and I need to be able to handle them in Excel.
They look like this: MO_SAL_EXP=1_62112_10_10+1_62120_10_10+1_62130_10_10
What we call the "complex criterias" (here MO_SAL_EXP) are always made of capital letters and underscore, but length is variable as well as the number of underscores.
What we call the "basic criterias" are always structured this way "N_NNNNN_NN_NN" (N being a number). The humber of basic criterias in a formula may vary and all types of calculations are possible: +, -, *, / with or without ().
Other big issue for me: there is text embedded between some formulas (for example, "MO_SAL_EXP=1_62112_10_10+1_62120_10_10+1_62130_10_10 should be positive while MB_PROD_NI_VEG should be negative")
The result I want to obtain is a list with all formulas and/or criterias. If the list is just text, it's OK. So, basically, I want to get rid of the polluting text between the formulas contained in my word documents.
I know it is tricky and uncommon, but if you have any idea, I would very much appreciate your help.
Thanks!
I would need your help to extract specific strings from a word document.
Actually, one of my colleagues typed in math formulas (thousands of them) in MS Word and I need to be able to handle them in Excel.
They look like this: MO_SAL_EXP=1_62112_10_10+1_62120_10_10+1_62130_10_10
What we call the "complex criterias" (here MO_SAL_EXP) are always made of capital letters and underscore, but length is variable as well as the number of underscores.
What we call the "basic criterias" are always structured this way "N_NNNNN_NN_NN" (N being a number). The humber of basic criterias in a formula may vary and all types of calculations are possible: +, -, *, / with or without ().
Other big issue for me: there is text embedded between some formulas (for example, "MO_SAL_EXP=1_62112_10_10+1_62120_10_10+1_62130_10_10 should be positive while MB_PROD_NI_VEG should be negative")
The result I want to obtain is a list with all formulas and/or criterias. If the list is just text, it's OK. So, basically, I want to get rid of the polluting text between the formulas contained in my word documents.
I know it is tricky and uncommon, but if you have any idea, I would very much appreciate your help.
Thanks!