Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations SkipVought on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

How to create info based on an unknown imported dataset (no clue of # and kind of variables)?

Status
Not open for further replies.

Duckman321

Programmer
Nov 23, 2013
1
0
0
NL
Hi,

I'm facing the following 'challenge': based on *any* imported dataset with an unknown number / set of variables (by proc import, or just simply dragged into Enterprise Guide) I want to create the following information among other things (as a report or output in a dataset):
1) The number of records filled and/or blanc (OBC_ID=_n_ creates a copy of obs, always filled, so that should be an easy step in the process)
2) Per variable (presented by variable name), the total sum of all records (in case of numeric field)
3) Per variable (presented by variable name), the ratio filled records vs. blanc records, so the % of the variable name that is blanc (for numeric and character variables)

The dataset vcolumn in SASHELP contains all variables of the (random) imported set, but not the info as needed unfortunately. For this assignment, it looks like I need to create a loop in which all unknown variables are assigned to a macro, and from there the code should read out all records of the variable, so the output can be generated? Should that be a good approach? If so, how can this be done?

For example, the output has to provide me with information as presented below (all variables of the imported dataset which contains unknown variables should be present, so the dummy dataset as presented as the desired output below does contain 7 variables):

***************************
Output, based on a random imported set:

Variable name | number of records in total dataset | total sum of records | % blanc (or % filled)
Video_ID | 1600 | 1280800 | 100%
Video_ID2 | 800 | 158766 | 50%
Name | 1600 | N/A (Char) | 100%
Address | 400 | N/A (Char) | 25%
Amount | 1600 | 5500 | 100%
Date_rent | 1600 | N/A (output=sum dates) | 100%
Date_due | 1600 | N/A (output=sum dates) | 100%

How can this be done...? There's no dataset in SASHELP with such info, if I'm correct.

Thanks for your input!!
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top