1.

Have You Ever Done Any Edit Check Programs In Your Project, If You Have, Tell Me What Do You Know About Edit Check Programs?

Answer»

Yes I have done edit check programs .Edit check programs – Data validation.

1.Data Validation – proc means, proc univariate, proc freq.Data Cleaning – finding errors.
2.Checking for invalid character values.Proc freq data = patients;Tables gender dx ae / nocum nopercent;Run;Which gives frequency counts of unique character values.
3. Proc print with where statement to list invalid data values.[systolic blood pressure – 80 to 100][diastolic blood pressure – 60 to 120]
4. Proc means, univariate and tabulate to look for outliers.Proc means – min, max, n and mean.Proc univariate – five highest and lowest values[ stem leaf plots and box plots]
5. PROC FORMAT – range checking
6. Data Analysis – set, merge, UPDATE, keep, drop in data step.
7. CREATE datasets – PROC IMPORT and data step from flat files.
8. Extract data – LIBNAME.
9. SAS/STAT – PROC ANOVA, PROC REG.
10. Duplicate Data – PROC SORT Nodupkey or NoduplicateNodupkey – only checks for duplicates in BYNoduplicate – checks entire observation (matches all variables)For getting duplicate observations first sort BY nodupkey and merge it back to the original dataset and keep only records in original and sorted.
11.For creating analysis datasets from the raw data sets I used the PROC FORMAT, and RENAME and length statements to make changes and finally make a analysis data set.

Yes I have done edit check programs .Edit check programs – Data validation.

1.Data Validation – proc means, proc univariate, proc freq.Data Cleaning – finding errors.
2.Checking for invalid character values.Proc freq data = patients;Tables gender dx ae / nocum nopercent;Run;Which gives frequency counts of unique character values.
3. Proc print with where statement to list invalid data values.[systolic blood pressure – 80 to 100][diastolic blood pressure – 60 to 120]
4. Proc means, univariate and tabulate to look for outliers.Proc means – min, max, n and mean.Proc univariate – five highest and lowest values[ stem leaf plots and box plots]
5. PROC FORMAT – range checking
6. Data Analysis – set, merge, update, keep, drop in data step.
7. Create datasets – PROC IMPORT and data step from flat files.
8. Extract data – LIBNAME.
9. SAS/STAT – PROC ANOVA, PROC REG.
10. Duplicate Data – PROC SORT Nodupkey or NoduplicateNodupkey – only checks for duplicates in BYNoduplicate – checks entire observation (matches all variables)For getting duplicate observations first sort BY nodupkey and merge it back to the original dataset and keep only records in original and sorted.
11.For creating analysis datasets from the raw data sets I used the PROC FORMAT, and rename and length statements to make changes and finally make a analysis data set.



Discussion

No Comment Found