Wednesday, April 28, 2010
Thursday, April 22, 2010
Read CSV file with carriage return characters
/* double quotes with a specified character. This sample */
/* uses '@' and '$', but any character can be used, including */
/* spaces. CR/LFs not in double quotes will not be replaced. */
%let repA=' '; /* replacement character LF */
%let repD=' '; /* replacement character CR */
%let dsnnme="/home/help/zzz.csv"; /* use full path of CSV file */
data _null_;
/* RECFM=N reads the file in binary format. The file consists */
/* of a stream of bytes with no record boundaries. SHAREBUFFERS */
/* specifies that the FILE statement and the INFILE statement */
/* share the same buffer. */
infile &dsnnme recfm=n sharebuffers;
file &dsnnme recfm=n;
/* OPEN is a flag variable used to determine if the CR/LF is within */
/* double quotes or not. Retain this value. */
retain open 0;
input a $char1.;
/* If the character is a double quote, set OPEN to its opposite value. */
if a = '"' then open = ^(open);
/* If the CR or LF is after an open double quote, replace the byte with */
/* the appropriate value. */
if open then do;
if a = '0D'x then put &repD;
else if a = '0A'x then put &repA;
end;
run;
filename infle '/home/help/zzz.csv' termstr=CRLF;
PROC IMPORT OUT= zzz DATAFILE= infle
DBMS=CSV REPLACE;
GUESSINGROWS=1000;
GETNAMES=YES;
DATAROW=2;
RUN;
Monday, April 19, 2010
MSGLEVEL option
The default is MSGLEVEL=N.
OPTIONS MSGLEVEL=I;
See also
Wednesday, April 14, 2010
Read all files in one directory
%macro readraw(dir=.);
%local fileref rc did dnum dmem memname;
%let fileref=thisdir;
%let rc=%sysfunc(filename(fileref,&dir));
%let did=%sysfunc(dopen(&fileref));
%let dnum=%sysfunc(dnum(&did));
%do dmem=1 %to &dnum;
%let memname=%sysfunc(dread(&did, &dmem));
%if %upcase(%scan(&memname,-1,.)) = DAT %then %do;
%let dataset=%scan(&memname,1,.);
data &dataset;
infile "&dir\&memname";
input Course_code $4. Location $15. Begin_date date9.;
format Begin_date date9.;
run;
proc print data=&dataset;
title "%trim(&syslast)";
run;
%end;
%end;
%let rc=%sysfunc(dclose(&did));
%let rc=%sysfunc(filename(fileref));
%mend readraw;
options mprint;
%readraw(dir=c:\workshop\winsas\amacr);
Monday, April 5, 2010
Friday, March 19, 2010
Tuesday, March 16, 2010
Monday, March 15, 2010
RSASUSER System Option: OpenVMS
“WARNING: Unable to copy SASUSER registry to WORK registry. Because of this, you will not see registry customizations during this session.”
Link: RSASUSER
Thursday, March 11, 2010
Proc Import trick
guessingrows = 1000;
specifies the number of rows of the file to scan in order to determine the appropriate data type and length for the columns. The scan data process scans from row 1 to the number that is specified by GUESSINGROWS=. The default value is 20. However, you can change the default value in the SAS Registry under SAS REGISTRY -> PRODUCTS -> BASE -> EFI -> GuessingRows.
| Default: | 20 |
| Requirement: | This number should be greater than the value that is specified for DATAROW=. |
Monday, March 8, 2010
Sunday, February 28, 2010
Wednesday, February 24, 2010
Tuesday, February 23, 2010
ANY* and NOT* Function
data test;
do dec=0 to 255;
byte=byte(dec);
hex=put(dec,hex2.);
anycntrl=anycntrl(byte);
anygraph=anygraph(byte);
anypunct=anypunct(byte);
notalpha=notalpha(byte);
notprint=notprint(byte);
output;
end;
run;
proc print data=test;
run;
Sunday, February 21, 2010
PROC SURVEYSELECT
Select a random sample where no observation can be chosen more than once.
/* WORK.EASTHIGH is a data base of student grade point averages */
/* from East High School, Grades 9 through 12, 100 or more students per grade. */
data EastHigh;
format GPA 3.1;
do Grade=9 to 12;
do StudentID=1 to 100+int(201*ranuni(432098));
GPA=2.0 + (2.1*ranuni(34280));
output;
end;
end;
run;
/* Using PROC SURVEYSELECT */
/* Use METHOD=SRS. N= is the number of observations to select. */
/* The sample is stored in the OUT= data set, SAMPLE1. */
proc surveyselect data=EastHigh method=srs n=15 out=sample1;
run;
title "PROC SURVEYSELECT";
proc print data=sample1;
run;
Simple random sample with replacement
Select a random sample where an observation can be chosen more than once.
/* Using PROC SURVEYSELECT */
/* Use METHOD=URS. N= is the number of observations to select. */
/* The sample is stored in the OUT= data set, SAMPLE. */
/* The OUTHITS option includes an observation in the OUT= data set */
/* for each selected unit. By default, the OUT= will contain one */
/* observation for each unique selected unit and the NumberHits */
/* variable identifies the number of times each unit is selected. */
proc surveyselect data=EastHigh method=urs n=15 out=sample outhits;
run;
title "PROC SURVEYSELECT ";
proc print;
run;
Saturday, February 20, 2010
Wednesday, February 10, 2010
Look up system options and display setting
Superscript, Subscript in RFT
length secnam $15;input sortord secnam $ pvalue;
cards;
1 demog 0.8812
2 ae 0.7112
3 disposition 0.8112
4 medicalhistory 0.9112
;
run;
ods listing close;
ods rtf file="Test.rtf" style=rtfout;
ods escapechar='\'; *** value not limited to \;
proc report data = test missing split="$" spacing=0 headline nowd;
column sortord secnam pvalue;
define sortord / order noprint;
define secnam / order flow "Demographics$Variable\{super a}";
define pvalue / display flow "ANOVA$P-Value\{sub p}";
footnote1 "testing\{super 1}";
footnote2 "new test\{sub 2}";
run;
ods rtf close;
ods trace off;
ods listing;
Monday, February 8, 2010
Changing titles in ODS Content Page
ODS PROCLABEL "" ; SAS Qualification Tools
- The SAS IQ assists in demonstrating the SAS System has been installed and maintained to the manufacturer's specifications. SAS IQ verifies the integrity of each file in the SAS System 9 and provides the customer a set of reports detailing the results.
- The SAS OQ assists in demonstrating the SAS System is operational. SAS OQ uses SAS programs provided by the component development groups and will execute, process, and report the program results.
Thursday, February 4, 2010
Tuesday, February 2, 2010
Creating a Stored Macro Facility
libname macro '/home/dltmp/maclib';
%macro test / store des='This is my program';
%let x=1;
%mend;
%copy test/source;
proc catalog cat=macro.sasmacr;
contents;
run;
Link: http://www2.sas.com/proceedings/forum2008/101-2008.pdf
Handle special character when import data
Link: http://support.sas.com/kb/18/688.html
Sunday, January 31, 2010
Utilize SAS Audit Files to Keep Track of SAS Data Set Updates
Creating an audit trail file for a SAS data set is easily done via the DATASETS Procedure.
proc datasets library=ctemp;
audit orsales;
initiate;
run;
quit;
Executing the code results in a SAS audit file named orsales.sas7baud being built in the CTEMP SAS data library. When updates are made to the orsales SAS data set, entries are made to the attendant orsales SAS audit file.
proc contents data=ctemp.orsales(type=audit);
run;
proc print data=ctemp.orsales(type=audit);
run;
You can determine the variables available in your audit file or print the audit file observations using PROC CONTENTS and PROC PRINT, respectively, using the TYPE=AUDIT data set option.
Note that the Member Type in the Attributes section of the CONTENTS output will state "AUDIT", and that you will get a few additional audit-related variables. You can use other SAS procedures, such as PROC REPORT, on the SAS audit file, too.
Friday, January 29, 2010
Sunday, January 24, 2010
Jump start in PROC REPORT
First, run PROC REPORT against the data set of interest, including the LIST option to list the code in the log and the NOEXEC option to prevent SAS from actually running the code. For example:
proc report data = sashelp.class list noexec ;
run ;
This should appear in the log:
PROC REPORT DATA=SASHELP.CLASS LS=96 PS=55 SPLIT="/" CENTER ;
COLUMN Name Sex Age Height Weight;
DEFINE Name / DISPLAY FORMAT= $8. WIDTH=8 SPACING=2 LEFT "Name" ;
DEFINE Sex / DISPLAY FORMAT= $1. WIDTH=1 SPACING=2 LEFT "Sex" ;
DEFINE Age / SUM FORMAT= BEST9. WIDTH=9 SPACING=2 RIGHT "Age" ;
DEFINE Height / SUM FORMAT= BEST9. WIDTH=9 SPACING=2 RIGHT "Height" ;
DEFINE Weight / SUM FORMAT= BEST9. WIDTH=9 SPACING=2 RIGHT "Weight" ;
RUN;
Copy the code to the program editor, and make any needed changes. Suppose we want average heights and weights by sex and age. Just insert the NOWD option in the PROC statement (assuming that interactive windowing is not needed), comment out the reference to NAME in the COLUMN statement and the DEFINE statement for NAME, and change the first option in each of the remaining DEFINE statements:
PROC REPORT DATA=SASHELP.CLASS LS=96 PS=55 SPLIT="/" CENTER nowd ;
COLUMN /* Name */ Sex Age Height Weight;
* DEFINE Name / DISPLAY FORMAT= $8. WIDTH=8 SPACING=2 LEFT "Name" ;
DEFINE Sex / group FORMAT= $1. WIDTH=1 SPACING=2 LEFT "Sex" ;
DEFINE Age / group FORMAT= BEST9. WIDTH=9 SPACING=2 RIGHT "Age" ;
DEFINE Height / mean FORMAT= BEST9. WIDTH=9 SPACING=2 RIGHT "Height" ;
DEFINE Weight / mean FORMAT= BEST9. WIDTH=9 SPACING=2 RIGHT "Weight" ;
RUN;
The output:
S
e
x Age Height Weight
F 11 51.3 50.5
12 58.05 80.75
13 60.9 91
14 63.55 96.25
15 64.5 112.25
M 11 57.5 85
12 60.366667 103.5
13 62.5 84
14 66.25 107.5
15 66.75 122.5
16 72 150
Cosmetic refinements can be made by changing the WIDTH option for SEX and the FORMAT options for HEIGHT and WEIGHT.
Thursday, January 21, 2010
Use EXIST to check dataset
%MACRO D_EXIST(INPUT); %SYSFUNC(EXIST(&INPUT)) %MEND D_EXIST;
%IF %D_EXIST(SASHELP.CLASS) %THEN %DO;
Wednesday, January 20, 2010
Monday, January 18, 2010
Determining the number of observations in a SAS data set efficiently
- The name of the data set and its member type
- The date and time the data set was created
- The number of observations
- The number of variables
- The engine type
Here there are some examples:
/* Before SAS V9*/
data _NULL_;
if 0 then set sashelp.class nobs=n;
call symput('nrows',trim(left(put(n,8.))));
stop;
run;
%put nobs=&nrows;
/* SAS V9.x and higher */
data _NULL_;
if 0 then set sashelp.class nobs=n;
call symputx('nrows',n);
stop;
run;
%put nobs=&nrows;- The "if 0" is a a conditional statement designed to never be executed at run-time and works since the "set sashelp.class" gives the header information of the data set class to the compiler which sets up the pdv accordingly, but is skipped in the execution time because the if condition is false.
- Nobs is a SAS automatic variable which contains the number of records in the dataset named in the set statement. The code, nobs = n, places the value of nobs (the number of records in the sashlep.class dataset) in the variable n. [NOBS option limitations: 1."The number of observations includes those observations that are marked for deletion but are not yet deleted."2."For certain SAS views, SAS cannot determine the number of observations. In these cases, SAS sets the value of the NOBS= variable to the largest positive integer value that is available in your operating environment.]
- The STOP statement is used to prevent an endless loop because EOF is never reached.
data _null_;
call symput('AnyObs','0'); /* set to 0 for No */
set &data(obs=1);
call symput('AnyObs','1'); /* if got here there are obs so set to 1 for Yes */
run;The meta data can also be accessed by the macro language using DATA step functions. The ATTRN function is used with the NLOBS argument in the %OBSCNT macro shown below to retrieve the number of non-deleted observations from the meta data.
%macro obscnt(dsn); %local nobs dsnid; %let nobs=.; %* Open the data set of interest; %let dsnid = %sysfunc(open(&dsn)); %* If the open was successful get the; %* number of observations and CLOSE &dsn; %if &dsnid %then %do; %let nobs=%sysfunc(attrn(&dsnid,nlobs)); %let rc =%sysfunc(close(&dsnid)); %end; %else %do; %put Unable to open &dsn - %sysfunc(sysmsg()); %end; %* Return the number of observations; &nobs %mend obscnt;
Monday, January 11, 2010
Forcing the Concatenation of Data Sets
PROC APPEND BASE=master DATA=txn FORCE; RUN;
Friday, January 1, 2010
Systask Statement
For example, in your BIGPROGRAM.SAS, you could have the following code:
systask command "sas your_clever_program1.sas" taskname=program1;
systask command "sas your_clever_program2.sas" taskname=program2;
waitfor _all_ program1 program2;
/* Other SAS code that processes the data sets created by the two afore-mentioned programs */
Note that the _all_ option states that your program is to wait for both program1 _AND_ program2 to complete before resuming execution. As an alternative, you could have coded _any_, which would have your program resume the moment either program1 _OR_ program2 completed execution.
Link: http://support.sas.com/documentation/cdl/en/hostwin/61924/HTML/default/win-stmt-systask.htm
Link: http://www2.sas.com/proceedings/sugi30/021-30.pdf
Monday, December 28, 2009
Call missing
The CALL MISSING routine assigns a character missing value (a blank) to each character variable in the argument list. If the current length of the character variable equals the maximum length, the current length is not changed. Otherwise, the current length is set to 1.
You can mix character and numeric variables in the argument list.
prod='shoes';
invty=7498;
sales=23759;
call missing(prod,invty);
put prod= invty= sales=;
prod= invty=. sales=23759
Wednesday, December 23, 2009
XMRLMEM option
data _null_;
format amt comma20.;
amt = input(getoption('xmrlmem'),20.);
put amt=;
run;
The code shown above gives the total number of bytes of real memory available. (It does not count the operating system's virtual memory; only real memory). Divide this number by 1024 to get it into K's... or whatever to get it into Megs or Gigs.
XMRLMEM is an undocumented diagnostic option that can come in handy when considering allocating a bushel-full of buffers or considering doing some big-time hashing.
Link: http://www2.sas.com/proceedings/forum2007/271-2007.pdf
Saturday, December 19, 2009
Seperate data into EXCEL sheets
proc sql noprint;
select distinct age
into :idage1 - :idage99
from sashelp.class;
%let agecnt = &sqlobs;
quit;
%do i = 1 %to &agecnt;
PROC EXPORT DATA = sashelp.class(where=(age=&&idage&i))
OUTFILE="C:\temp\class_ages.xls"
DBMS= excel
REPLACE;
sheet = "Age_&&idage&i";
RUN;
%END;
%mend multisheet;
%multisheet
Friday, December 4, 2009
List Oracle tables in SAS
libname oralib ORACLE user=ctpmstat path="@xxxx" password = xxxxxxx;
proc freq data = oralib.all_tables;
tables owner * table_name/list;
run;
Wednesday, December 2, 2009
X Command in SAS
%macro find ;
X 'find /home/xxx -name "*.xpt" -exec ls -i1 {} \; | sort > /home/xxx/zzz.txt';
%if &sysrc ne 0 %then %do ;
endsas;
%end ;
%mend find ;
%find ;
Link: http://support.sas.com/documentation/cdl/en/hostwin/61924/HTML/default/exittemp.htm
Link: www.lexjansen.com/pharmasug/2005/posters/po13.pdf
Sunday, November 29, 2009
Tuesday, November 24, 2009
Proc CDISC to validate SDTM
PROC CDISC MODEL=SDTM;
SDTM SDTMVERSION= "3.1";
DOMAINDATA DATA=WORK.DM
DOMAIN=DM
CATEGORY=SPECIAL;
RUN;
** ‘Events’ class domain: AE **;
PROC CDISC MODEL=SDTM;
SDTM SDTMVERSION= "3.1";
DOMAINDATA DATA=WORK.AE
DOMAIN=AE
CATEGORY=EVENTS;
RUN;
** ‘Interventions’ class domain: CM **;
PROC CDISC MODEL=SDTM;
SDTM SDTMVERSION= "3.1";
DOMAINDATA DATA=WORK.CM
DOMAIN=CM
CATEGORY=INTERVENTIONS;
RUN;
** ‘Findings’ class domain: IE **;
PROC CDISC MODEL=SDTM;
SDTM SDTMVERSION= "3.1";
DOMAINDATA DATA=WORK.IE
DOMAIN=IE
CATEGORY=FINDINGS;
RUN;
Link: http://www.lexjansen.com/pharmasug/2008/rs/rs07.pdf
Link: http://support.sas.com/rnd/base/xmlengine/proccdisc/index.html
Monday, November 23, 2009
Startpage in ODS
controls page breaks.
NEVER specifies not to insert page breaks, even before graphics procedures.
CAUTION:
Each graph normally requires an entire page. The default behavior forces a new page after a graphics procedure. STARTPAGE=NEVER turns off that behavior, so specifying STARTPAGE= NEVER might cause graphics to overprint.
NO specifies that no new pages be inserted at the beginning of each procedure, or within certain procedures, even if new pages are requested by the procedure code. A new page will begin only when a page is filled or when you specify STARTPAGE=NOW.
CAUTION:
Each graph normally requires an entire page. The default behavior forces a new page after a graphics procedure, even if you use STARTPAGE=NO. STARTPAGE=NEVER turns off that behavior, so specifying STARTPAGE= NEVER might cause graphics to overprint.
Alias: OFF
Tip: When you specify STARTPAGE=NO, system titles and footnotes are still produced only at the top and bottom of each physical page, regardless of the setting of this option. Thus, some system titles and footnotes that you specify might not appear when this option is specified.
NOW forces the immediate insertion of a new page.
Tip: This option is useful primarily when the current value of the STARTPAGE= option is NO. Otherwise, each new procedure forces a new page automatically.
YES inserts a new page at the beginning of each procedure, and within certain procedures, as requested by the procedure code.
Alias: ON
Default: YES
Link: http://analytics.ncsu.edu/sesug/2007/DP05.pdf
Link: http://www.nesug.org/Proceedings/nesug06/io/io13.pdf
Formdlim option
'delimiting-character'
specifies in quotation marks a character written to delimit pages. Normally, the delimit character is null, as in this statement:
options formdlim='';
use space will eliminate the delimitor between pages in the listing:
options formdlim=' ';
Data Step Views
| Definition of a DATA Step View |
| Creating DATA Step Views |
In order to create a DATA step view, specify the VIEW= option after the final data set name in the DATA statement. The VIEW= option tells SAS to compile, but not to execute, the source program and to store the compiled code in the input DATA step view that is named in the option.
For example, the following statements create a DATA step view named DEPT.A:
libname dept 'SAS-library'; data dept.a / view=dept.a; ... more SAS statements ... run;Note that if the SAS view exists in a SAS library, and if you use the same member name to create a new view definition, then the old SAS view is overwritten.
Beginning with Version 8, DATA step views retain source statements. You can retrieve these statements using the DESCRIBE statement. The following example uses the DESCRIBE statement in a DATA step view in order to write a copy of the source code to the SAS log:
data view=inventory; describe; run;For more information on how to create SAS views and use the DESCRIBE statement, see the DATA statement in SAS Language Reference: Dictionary.
| What Can You Do with a DATA Step View? |
Using a DATA step view, you can do the following:
- directly process any file that can be read with an INPUT statement
- read other SAS data sets
- generate data without using any external data sources and without creating an intermediate SAS data file.
| Differences between DATA Step Views and Stored Compiled DATA Step Programs |
DATA step views and stored compiled DATA step programs differ in the following ways:
- a DATA step view is implicitly executed when it is referenced as an input data set by another DATA or PROC step. Its main purpose is to provide data, one record at a time, to the invoking procedure or DATA step.
- a stored compiled DATA step program is explicitly executed when it is specified by the PGM= option on a DATA statement. Its purpose is usually a more specific task, such as creating SAS data files, or originating a report.
| Restrictions and Requirements |
Global statements do not to apply to a DATA step view. Global statements such as the FILENAME, FOOTNOTE, LIBNAME, OPTIONS, and TITLE statements, even if included in the DATA step that created the SAS view, have no effect on the SAS view. If you do include global statements in your source program statements, SAS stores the DATA step view but not the global statements. When the view is referenced, actual execution can differ from the intended execution.
When a view is created, the labels for the variable that it returns are also created. If a DATA step view reads a data set that contains variable labels and a label is changed after the view is created, any procedure output will show the original labels. The view must be recompiled in order for the procedure output to reflect the new variable labels.
If a view uses filerefs or librefs, the fileref or libref that is used is the one that is defined at the time that the view is compiled. This means that if you change the file that is referenced in a fileref that the view uses, the new file is ignored by the view and the file that is referred to by the fileref at the time the view was compiled continues to be used.
| Performance Considerations |
- DATA step code executes each time that you use a DATA step view, which might add considerable system overhead. In addition, you run the risk of having your data change between steps. However, this also means that you get the most recent data available--that is, data when the view is executed as compared to data when the view was compiled.
- Depending on how many reads or passes on the data are required, processing overhead increases.
- When one sequential pass is requested, no data set is created. Compared to traditional methods of processing, making one pass improves performance by decreasing the number of input/output operations and elapsed time.
- When random access or multiple passes are requested, the SAS view must build a spill file that contains all generated observations so that subsequent passes can read the same data that was read by previous passes. In some instances, the view SPILL= data set option can reduce the size of a spill file.
- When one sequential pass is requested, no data set is created. Compared to traditional methods of processing, making one pass improves performance by decreasing the number of input/output operations and elapsed time.
| Example 1: Merging Data to Produce Reports |
If you want to merge data from multiple files but you do not need to create a file that contains the combined data, you can create a DATA step view of the combination for use in subsequent applications.
For example, the following statements define DATA step view MYV9LIB.QTR1, which merges the sales figures in the data file V9LR.CLOTHES with the sales figures in the data file V9LR.EQUIP. The data files are merged by date, and the value of the variable Total is computed for each date.
libname myv9lib 'SAS-library'; libname v9lr 'SAS-library'; data myv9lib.qtr1 / view=myv9lib.qtr1; merge v9lr.clothes v9lr.equip; by date; total = cl_v9lr + eq_v9lr; run;
The following PRINT procedure executes the view:
proc print data=myv9lib.qtr1; run;
| Example 2: Producing Additional Output Files |
The FILENAME and the LIBNAME statements are both global statements and must exist outside of the code that defines the SAS view, because SAS views cannot contain global statements.
Here are the contents of the external file STUDENT:
dutterono MAT 3 lyndenall MAT frisbee MAT 94 SCI 95 zymeco ART 96 dimette 94 mesipho SCI 55 merlbeest ART 97 scafernia 91 gilhoolie ART 303 misqualle ART 44 xylotone SCI 96
Here is the DATA step that produces the output files:
libname myv9lib 'SAS-library'; filename student 'external-file-specification'; 1 data myv9lib.class(keep=name major credits) myv9lib.problems(keep=code date) / view=myv9lib.class; 2 infile student; input name $ 1-10 major $ 12-14 credits 16-18; 3 select; when (name=' ' or major=' ' or credits=.) do code=01; date=datetime(); output myv9lib.problems; end; 4 when (0<90) do code=02; date=datetime(); output myv9lib.problems; end; 5 otherwise output myv9lib.class; end; run; 6
The following example shows how to print the files created previously. The MYV9LIB.CLASS contains the observations from STUDENT that were processed without errors. The data file MYV9LIB.PROBLEMS contains the observations that contain errors.
If the data frequently changes in the source data file STUDENT, there would be different effects on the returned values in the SAS view and the SAS data file:
- New records, if error free, that are added to the source data file STUDENT between the time you run the DATA step in the previous example and the time you execute PROC PRINT in the following example, will appear in the SAS view MYV9LIB.CLASS.
- On the other hand, if any new records, failing the error tests, were added to STUDENT, the new records would not show up in the SAS data file MYV9LIB.PROBLEM, until you run the DATA step again.
filename student 'external-file-specification'; libname myv9lib 'SAS-library'; 7 proc print data=myv9lib.class; run; 8 proc print data=myv9lib.problems; format date datetime18.; run; 9
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| Link: http://www2.sas.com/proceedings/sugi29/067-29.pdf |
Saturday, November 21, 2009
Wednesday, November 11, 2009
“??” FORMAT MODIFIER
The following excerpt is from SAS OnlineDoc documentation: ? or ??
The optional question mark (?) and double question mark (??) format modifiers suppress the printing of both the error messages and the input lines when invalid data values are read. The ? modifier suppresses the invalid data message. The ?? modifier also suppresses the invalid data message and, in addition, prevents the automatic variable _ERROR_ from being set to 1 when invalid data are read. Below is an example of using ?? to determine whether a variable contains non-numeric values or not:
data _null_;
x = “12345678”;
if (input(x, ?? 8.) eq .) then
put ‘non-numeric’;
else put ‘numeric’;
run;
Running SAS would return “Numeric” in the above example. If we used X=”123a5678”, SAS would return “Non-Numeric”. Note that the input format in the above example is “8.” So only the first 8 bytes of the character string are checked. Thus, X=123456789a would return “Numeric” as it would only be checking the first 8 bytes of the string.
Link: http://www.nesug.org/Proceedings/nesug01/at/at1013.pdf
Sunday, November 8, 2009
Change Position of Variables Using RETAIN
Input x y z;
1 2 3
;
Run;
Retain z y x;
Run;
COUNTW Function (SAS 9.2)
proc sort data=sashelp.zipcode(keep=statename) out=allnames nodupkey; by statename; run; data words; set allnames; Words=countw(statename," "); run; proc freq data=words; tables Words; title1 "Number of Words in U.S. State and Territory Names"; run;
SAS Log
3999 proc sort data=sashelp.zipcode(keep=statename) 4000 out=allnames nodupkey; 4001 by statename; 4002 run; NOTE: There were 41763 observations read from the data set SASHELP.ZIPCODE. NOTE: 41707 observations with duplicate key values were deleted. NOTE: The data set WORK.ALLNAMES has 56 observations and 1 variables. 4003 data words; 4004 set allnames; 4005 Words=countw(statename," "); 4006 run; NOTE: There were 56 observations read from the data set WORK.ALLNAMES. NOTE: The data set WORK.WORDS has 56 observations and 2 variables. 4007 proc freq data=words; 4008 tables Words; 4009 title1 "Number of Words in U.S. State and Territory Names"; 4010 run; NOTE: There were 56 observations read from the data set WORK.WORDS.
SAS Listing Output
Number of Words in U.S. State and Territory Names The FREQ Procedure Cumulative Cumulative Words Frequency Percent Frequency Percent ---------------------------------------------------------- 1 42 75.00 42 75.00 2 12 21.43 54 96.43 3 2 3.57 56 100.00
Thursday, November 5, 2009
Unix Commands
edit mode for temp.sas;
read only;
create directory called test;
remove directory test, test needs be emptied first;
remove directory test, test doesn't need to be emptied first;
go to the directory one level up;
go to directory called tmpdir;
copy temp.sas as temp.sas;
1). change delete/cut copy
__________________________________________
2).
move to line 6: :6 or 6G
Join 5 lines: 5J;
insert: i or a;
repeat: .;
:3,18m23 move lines 3 through 18 after line 23;
:20,.m$ move line 20 through current line ( . ) to the end ( $ );
:r /temp/data.sas copy file data.sas in directory /temp and paste from current cursor position;
hp45 -z1 temp.lst
cp /home/xxx/programs/subdir/*.* .
uncompress *
chmod 751 temp.sas: change read only mode to edit mode;
chmod 770 temp.sas: allow group members to access;
chmod g+s pgmsdir: allow directory access;
-memsize 256m -work /lv10/tmp temp.sas
14).
cal --> show calendar;
15). set up timer for auto run:
crontab -e
00 05 06 11 * /lv06/sas91/sas /home/xxxxx/programs/temp.sas
(close editor using :wq)
00 15 * * 1-5 /lv03/sas82/sas /home/dlxxx/somejob.sas
4 45 11 28 9 * /usr/local/sas/SASFoundation/9.3/sas /sasdata/ltrc/reports/temp.sas
16). Search for file with a specific name in a set of files (-name)
find . -name "*conf" -print
This command will search in the current directory and all sub directories for a file ending with conf.
Note: The -print option will print out the path of any file that is found with that name. In general -print will print out the path of any file that meets the find criteria.
17). Use ls to create a sas file
ls . > temp.sas
This command will create a sas file temp.sas, which includes all the directory names, file names under current directory. Replace directory name with . to create file for other directory.
18). Use diff to compare two sas files
diff program1.sas program2.sas
Thursday, October 22, 2009
Positioning the SUBSTR Function
When the function is on the right side of an assignment statement, the function returns the requested string.
MiddleInitial=substr(middlename,1,1);
But if you place the SUBSTR function on the left side of an assignment statement, the function is used to modify variable values.
substr(region,1,3)='NNW';
When the SUBSTR function modifies variable values, the right side of the assignment statement must specify the value to place into the variable. For example, to replace the fourth and fifth characters of a variable named Test with the value 92, you write the following assignment statement:
substr(test,4,2)='92';
Test Test
S7381K2 S7392K2
S7381K7 S7392K7
Tuesday, September 22, 2009
Resolve Function
%let last=Hanks;
data _null_;
name1='Mr '||"&first"||' '||"&last";
name2=resolve('Mr &first &last');
put name1;
put name2;
run;
105 data _null_;
106 name1=‘Mr ’||”&first”||’ ’||”&last”;
107 name2=resolve('Mr &first &last');
108 put name1;
109 put name2;
110 run;
Mr Steve Hanks
Mr Steve Hanks
Using %SYSFUNC Functions
%let nobs=%sysfunc(attrn(&dsid,nobs)) ;
%let dsid=%sysfunc(close(&dsid)) ;
%put The number of obs in datamart.customer is &nobs ;
81 %let dsid=%sysfunc(open(datamart.customer)) ;
82 %let nobs=%sysfunc(attrn(&dsid,nobs)) ;
83 %let dsid=%sysfunc(close(&dsid)) ;
84 %put The number of obs in datamart.customer is &nobs ;
The number of obs in datamart.customer is 6299497
Using %SYSFUNC For Report Date
proc print data=sashelp.class;
title "We can put a todays (%sysfunc(today(),date9.)) date in a report title";
run;
SYMGLOBL and SYMLOCAL
%let c=yes;
%macro test;
%let d=yes;
data _null_;
if symlocal ('c') then put '**** c is local';
if symglobl('c') then put '**** c is global';
if symlocal ('d') then put '**** d is local';
if symglobl('d') then put '**** d is global';
run;
%mend test;
%test;
**** c is global
**** d is local
NOTE: DATA statement used (Total process time):
real time 0.00 seconds
cpu time 0.00 seconds
Deleting a Macro Variable
%let a=yes;
data _null_;
if symexist('a') then put '**** a exists';
call symdel('a');
if symexist('a') then put '**** a still exists';
else put '**** a doesnt exist';
run;
**** a exists
**** a doesnt exist
NOTE: DATA statement used (Total process time):
real time 0.01 seconds
cpu time 0.01 seconds
SYMEXIST to check if a Macro Variable Exists
data _null_;
if symexist('a') then put '**** a exists';
Else put '**** a doesnt exist';
if symexist('b') then put '**** b exists';
Else put '**** b doesnt exist';
run;
**** a exists
**** b doesnt exist
NOTE: DATA statement used (Total process time):
real time 0.00 seconds
cpu time 0.00 seconds
Monday, September 7, 2009
Macro IN Operator (SAS 9.2)
%macro test;
%if &dsn IN ae ds co cm %then %do;
Some---SAS—Statements;
%end;
%test;
Note:
No need of % sign in front of IN operator.
In SAS 9.2 both IN and # work if you use the system option minoperator inside your macro call.
%macro test/minoperator;
%if &dsn # ae ds co cm %then %do;
Some---SAS—Statements;
%end;
%test;
%macro test;
%if &dsn IN ae ds co cm %then %do;
Some---SAS—Statements;
%end;
%test;
MINOPERATOR option tells SAS to recognize the word 'IN' or special symbol # by the SAS macro facility as an infix operator when evaluating logical or integer expressions.
Here is another way of writing the macro code with delimiters.
Use MINDELIMITER option to change the default delimiter from space to any other, in this case it is comma (,).
options mindelimiter;
%macro test/mindelimiter=',';
%if &dsn IN ae,ds,co,cm %then %do;
Some---SAS—Statements;
%end;
%test;
Sunday, September 6, 2009
CALL SYMPUTX / CALL SYMPUT
•CALL SYMPUTX does not write a note to the SAS log when the second argument is numeric. CALL SYMPUT, however, writes a note to the log stating that numeric values were converted to character values.
•CALL SYMPUTX uses a field width of up to 32 characters when it converts a numeric second argument to a character value. CALL SYMPUT uses a field width of up to 12 characters.
•CALL SYMPUTX left-justifies both arguments and trims trailing blanks. CALL SYMPUT does not left-justify the arguments, and trims trailing blanks from the first argument only. Leading blanks in the value of name cause an error.
•CALL SYMPUTX enables you to specify the symbol table in which to store the macro variable, whereas CALL SYMPUT does not. Symbol table is optional and the valid value of it is G, L, and F. If put G, then the macro variable will be stored in the global symbol table; else if specify L, SAS will store the macro in the local symbol table; else if not specify or specify F, SAS follows the same rules as like for Call Symput.
Thursday, September 3, 2009
Remove formats/informats/labels from a SAS dataset
set demo;
attrib _all_ label=''; *remove labels;
format _all_; *remove formats;
informat _all_; *remove informats;
run;
Tuesday, August 4, 2009
SASHELP / DICTIONARY
SAS catalogs
VCOLUMN / COLUMNS
Data set columns and attributes
VEXTFL / EXTFILES
Allocated filerefs and external physical paths
VINDEX / INDEXES
Data set indexes
VMACRO / MACROS
Global and automatic macro variables
VMEMBER / MEMBERS
SAS data sets and other member types
VOPTION / OPTIONS
Current SAS System option settings
VTABLE / TABLES
SAS data sets and views
VTITLE / TITLES
Title and footnote definitions
VVIEW / VIEWS
SAS data views
CAT/CATS/CATT/CATX Functions
Even though we can use the concatenation operator in combination with the STRIP, TRIM, or LEFT functions, these functions make it much easier to put strings together and if you wish, to place one or more separator characters between the strings.
One advantage of using the call routines than the functions is improved performance.
Note: *Call routine executes faster than the function… in specific…
CALL CATS:
To concatenate two or more strings, removing both leading and trailing blanks.
CATS () stands for concatenate and strip.
Just think the ‘S’ at the end of the CATS as “Strip Blanks”.
Syntax: CALL CATS (result, string-1<, string-n>);
Example:
A=”ABC”;
B=” CONSULTING”;
C=”INC “;
D=” TEAM “;
FUNCTION= RESULT
CALL CATS(RESULT, A,B)= "ABCCONSULTING"
CALL CATS(RESULT, A,B,C)= "ABCCONSULTINGINC"
CALL CATS(RESULT, "HELLO",D)= "HELLOTEAM"
CALL CATT:
To concatenate two or more strings, removing only trailing blanks.
Just think the ‘T’ at the end of the CATT as “Trailing Blanks” or “Trim Blanks”.
Syntax: CALL CATS (result, string-1<, string-n>);
Example:
A=”ABC”;
B=” CONSULTING”;
C=”INC “;
D=” TEAM “;
FUNCTION= RESULT
CALL CATT(RESULT, A,B) ="ABC CONSULTING"
CALL CATT(RESULT, A,B,C)= "ABC CONSULTINGINC"
CALL CATT(RESULT, "HELLO",D)= "HELLO TEAM"
CALL CATX:
To concatenate two or more strings and removing both leading and trailing blanks and places a single space, or one or more characters of our choice, between each strings.
Just think the ‘X’ at the end of the CATX as “add eXtra blank.”
Syntax: CALL CATX (separator, result, string-1<, string-n>);
Example:
A=”ABC”;
B=” CONSULTING”;
C=”INC “;
D=” TEAM “;
FUNCTION =RESULT
CALL CATX(" ",RESULT, A,B) ="ABC CONSULTING"
CALL CATX(" ,", RESULT, A,B,C) ="ABC,CONSULTING,INC"
CALL CATX(' / ', RESULT, "HELLO",D) ="HELLO/TEAM"
CALL CATX(' *** ', RESULT, "HELLO",D) = "HELLO***TEAM"
Wednesday, July 22, 2009
FMTLIB
libname library 'c:\sas\formats\lib';
proc format library=library fmtlib;
run;
Monday, July 13, 2009
Combining a Grand Total with the Original Data
Proc means data=videos;
var sales;
output out=summarydat sum(sales)=grandtotal;
*** Combine the grand total with the original data;
data videosummary;
IF _N_=1 THEN SET summarydat;
SET videos;
percent=sales/grandtotal * 100;
Output:
sales grandtotal percent
1930 12880 14.9845
2250 12880 17.4689
...
System Options for Debugging Macro Errors
When this option is on, SAS will issue a warning if you invoke a macro that SAS cannot find.
SERROR (default) | NOSERROR
When this option is on, SAS will issue a warning if you use a macro variable that SAS cannot find.
MLOGIC | NOMLOGIC (default)
When this option is on, SAS prints in your log details about the execution of macros.
MPRINT | NOMPRINT (default)
When this options is on, SAS prints in your log the standard SAS code generated by macros.
SYMBOLGEN | NOSYMBOLGEN (default)
When this options is on, SAS prints in your log the values of macro values.