Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations IamaSherpa on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Normalize Column Data 3

Status
Not open for further replies.

mratx

Programmer
Sep 17, 2009
4
US
I have an old ugly system table that I need to normalize. It contains a pipe delimited part numbers list in one field and a corresponding node in another column.

Here is an example:
Code:
CREATE TABLE lookup(
   part_number VARCHAR2(100),
   node        NUMBER
);

INSERT 
  INTO lookup(part_number, node)
VALUES ('aaabbb|aaaccc|aaaddd|aaaeee', 100);

INSERT 
  INTO lookup(part_number, node)
VALUES ('aaafff', 200);

INSERT 
  INTO lookup(part_number, node)
VALUES ('aaaggg|aaahhh|aaaiii', 300);

SELECT * 
  FROM lookup;

Output:
part_number                      node
aaabbb|aaaccc|aaaddd|aaaeee	 100
aaafff	                         200
aaaggg|aaahhh|aaaiii	         300

My desired output from a single query would be:
Code:
part_number                      node
aaabbb                           100
aaaccc                           100
aaaddd                           100
aaaeee                           100
aaafff                           200
aaaggg                           300
aaahhh                           300
aaaiii                           300

I can parse a single row like I want if I use regexp_substr and connect by like this:
Code:
 SELECT regexp_substr(part_number, regex, 1, LEVEL), node
   FROM (SELECT part_number, node, '[^|]+' regex 
           FROM lookup
          WHERE node = 100)
CONNECT BY LEVEL <= LENGTH(REGEXP_REPLACE(part_number, regex)) + 1;

Output:
aaabbb	100
aaaccc	100
aaaddd	100
aaaeee	100

But, if I remove the node = 100 condition, then the results are wrong. I know I could parse through this easily using PL/SQL, but I'd like to see if it's possible with pure SQL first. In case it's not obvious, I'm on Oracle 11g. Thanks for the replies!
 
This is a bit clunky and there are probably better ways but at least its a start.

SQL> column part_number format a40
SQL> select * from lookup;

PART_NUMBER NODE
---------------------------------------- ----------
aaabbb|aaaccc|aaaddd|aaaeee 100
aaafff 200
aaaggg|aaahhh|aaaiii 300

1 WITH T AS
2 (SELECT node,part_number c
3 FROM lookup),
4 t2 AS
5 (SELECT node,'|'||c||'|' c FROM T)
6 SELECT distinct node,SUBSTR(c, INSTR(c,'|',1,LEVEL)+1,
7 INSTR(c, '|', 1, LEVEL+1)- INSTR (c, '|', 1, LEVEL)- 1) "String"
8 FROM t2
9 CONNECT BY LEVEL < LENGTH(c) - LENGTH(REPLACE(c,'|'))
10* order by node
SQL> /

NODE String
---------- --------------------
100 aaabbb
100 aaaccc
100 aaaddd
100 aaaeee
200 aaafff
300 aaaggg
300 aaahhh
300 aaaiii

8 rows selected.



In order to understand recursion, you must first understand recursion.
 
Thanks for the reply. This is essentially what my regexp does, except you added a DISTINCT to get rid of the duplicates. I appreciate seeing another way to do this, but performance wise, it's just as bad as the regexp version. I guess what I need is a query that won't create duplicates to begin with, and therefore won't require a costly DISTINCT clause to clean it up. Any ideas on that? Thanks again.
 
See if this performs better

SQL> l
1 with t as
2 (
3 select node, part_number str from lookup
4 )
5 select trim(x.column_value.extract('e/text()')) cols ,node
6 from t t, table (xmlsequence(xmltype('<e><e>' ||
7* replace(t.str,'|','</e><e>')|| '</e></e>').extract('e/e'))) x
SQL> /

COLS NODE
-------------------- ----------
aaabbb 100
aaaccc 100
aaaddd 100
aaaeee 100
aaafff 200
aaaggg 300
aaahhh 300
aaaiii 300



In order to understand recursion, you must first understand recursion.
 
Wow! Blazing fast! This is night and day compared the what I had, and I would give you 5 stars if I could! Thanks so much for your help!
 
Glad to be of help, have a star yourself for the simple fact of giving us table creation and insert statements in your OP. This makes investigating and answering questions soooooooooooo much easier. Others take note [2thumbsup]


In order to understand recursion, you must first understand recursion.
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top