












Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
This glossary provides definitions for key terms related to healthcare data analysis, covering concepts from statistical hypothesis testing to healthcare coding systems. It serves as a valuable resource for students and professionals in healthcare administration, data science, and related fields. The glossary includes definitions for terms such as alpha level, alternative hypothesis, ambulatory payment classification (apc), analysis of variance, and more.
Typology: Exams
1 / 20
This page cannot be seen from the preview
Don't miss anything!
Alpha level Correct Answers The level of Type I error that is deemed acceptable based on context in statistical hypothesis testing. See Type I error. Alternative hypothesis Correct Answers The complement of the null hypothesis that is to be tested using the appropriate statistical test. This hypothesis typically requires some action to be taken. Ambulatory payment classification (APC) Correct Answers The payment unit used in the hospital outpatient prospective payment system (OPPS). The classification is a resource-based reimbursement system. Analysis Correct Answers Reviewing and summarizing data for use in decision making. Analysis of variance Correct Answers The statistical tool used to compare more than two population means. The null hypothesis tests that all of the population means are equal. Auditing Correct Answers "The performance of internal and external reviews (audits) to identify variations from established baselines (for example, review of outpatient coding as compared with CMS outpatient coding guidelines)." Balanced design Correct Answers An experimental design where the number of subjects in each sample are the same for all
populations sampled. The term is relevant when performing an analysis of variance or a two sample t-test. Bell curve Correct Answers The shape of the normal distribution. The bell pealcs at the average and slopes down on both sides symmetrically. Binomial variable Correct Answers "A variable that talces only two values (such as yes or no; alive or dead). The probability of a yes or no is constant across all of the subjects, and the outcome of each subject is independent of the others." Case Mix Index (CMI) Correct Answers "The average relative weight of all cases treated at a given facility or by a given physician, which reflects the resource intensity or clinical severity of a specific group in relation to the other groups in the classification system; calculated by dividing the sum of the weights of diagnosis-related groups for patients discharged during a given period by the total number of patients discharged." Centers for Medicare and Medicaid Services (CMS) Correct Answers "The division of the Department of Health and Human Services that is responsible for developing healthcare policy in the United States, for administering the Medicare program and the federal portion of the Medicaid program, and maintaining the procedure portion of the International Classification of Diseases, ninth revision, Clinical Modification (ICD-9-CM) and International Classification of Diseases, tenth revision, Procedure Coding System (ICD-10-PCS); called the Health Care Financing Administration (HCFA) prior to 2001."
independent variable, then this value is the Pearson Correlation Coefficient squared." Confidence interval Correct Answers An interval that is centered at the sample estimate of a population value that may be calculated so that it has a preset probability of containing the population value. Confidence level Correct Answers The probability that a confidence interval includes the true value of a population statistic. Contingency tables Correct Answers A useful method for displaying the relationship between two categorical variables. Each category is displayed as rows or columns. The cells in the table represent the count of subjects with each category attribute. Convenience sampling Correct Answers A sampling technique where the selection of units from the population is based on easy availability and/or accessibility. Correlation Correct Answers A statistic that is used to describe the association or relationship between two continuous variables. Critical value Correct Answers The value that a test statistic must be larger than to conclude statistical significance. The value is based on the alpha level of the test and the distribution of the test statistic if the null hypothesis is true.
Current Procedural Terminology (CPT) Correct Answers "A comprehensive, descriptive list of terms and associated numeric and alphanumeric codes used for reporting diagnostic and therapeutic procedures and other medical services performed by physicians; published and updated annually by the American Medical Association." Data Correct Answers "The dates, numbers, images, symbols, letters, and words that represent basic facts and observations about people, processes, measurements, and conditions." Data dictionary Correct Answers "A descriptive list of the names, definitions, and attributes of data elements to be collected in an information system or database whose purpose is to standardize definitions and ensure consistent use." Data mining Correct Answers The process of extracting and analyzing large volumes of data from a database for the purpose of identifying hidden and sometimes subtle relationships or patterns and using those relationships to predict behaviors. Data or Proc Correct Answers Occur first in a SAS statement Database Correct Answers "A self-describing collection of integrated records. Each record has multiple attributes, and each attribute has one or more values per entry. The database is self describing because it contains a description of its own structure, and it is integrated because it has a relationship between the data items."
Exploratory data analysis Correct Answers The use of graphical techniques to identify and explore patterns in data. Frequency chart Correct Answers A graphical representation of a frequency distribution. Typically displayed as a bar chart where the height of the bars represent the frequency of observations in each category. Frequency distribution Correct Answers A table or graph that displays the number of times (frequency) a particular observation occurs. Healthcare Common Procedure Coding System (HCPCS) Correct Answers "An alphanumeric classification system that identifies healthcare procedures, equipment, and supplies for claim submission purposes; the three levels are as follows:
with the information they need to compare the performance of managed care plans. Hospital Compare Correct Answers A CMS-maintained website that reports the values of the quality indicators required for providers to participate in the Medicare value-based purchasing program. Hypothesis testing Correct Answers A statistical method that allows an analyst to measure the strength of evidence from the data to reject or not reject a research hypothesis. Independent Correct Answers The statistical term for the lack of relationship of two variables. Inferential statistics Correct Answers 1. Statistics that are used to make inferences from a smaller group of data to a large one.
MCC Correct Answers Major complication or comorbidity - patient requires higher level of resource intensity Mean Correct Answers A measure of central tendency that is determined by calculating the arithmetic average of the observations in a frequency distribution. Measures of central tendency Correct Answers The typical or average value that is descriptive of the entire collection of data for a specific population. Measures of variation Correct Answers Shows how widely observations are spread out around the measures of central tendency Median Correct Answers A measure of central tendency that shows the midpoint of a frequency distribution when the observations have been arranged in order from lowest to highest. Mode Correct Answers A measure of central tendency that consists of the most frequent observation in a frequency distribution. mRVU Correct Answers Malpractice expense component National Drug Codes (NDC) Correct Answers "Codes that serve as product identifiers for human drugs, currently limited to prescription drugs and a few selected over-the-counter products."
Nominal scale Correct Answers "Measurement scale that consists of categories with no natural or inferred order. Examples include diagnosis codes, clinical units, and color." Non probability sampling Correct Answers A sampling methodology where members of a sample are deliberately selected for a specified purpose. The sample is not selected at random and may not be used to make inference about the population. Null hypothesis Correct Answers "In hypothesis testing, the null hypothesis is typically the status quo or neutral position." One sample Z-test for proportions Correct Answers A hypothesis test for sample proportions that is used to test if the sample data collected supports the null hypothesis that the population proportion is equal to a fixed or standard value. One-sample t-test Correct Answers A hypothesis test for the sample mean that is used to test if the sample data collected supports the null hypothesis that the population mean is equal to a fixed or standard value. Ordinal scale Correct Answers "Measurement scale that consists of categories with a natural or inferred order. Examples include patient satisfaction scores, severity scores, and clinic visit level." Outpatient Prospective Payment System (OPPS) Correct Answers The Medicare prospective payment system used for hospital-based outpatient services and procedures that is
Population Correct Answers The universe of data under investigation from which a sample is taken. Predictive modeling Correct Answers A process used to identify patterns that can be used to predict the odds of a particular outcome based on the observed data. Primary data analysis Correct Answers The analysis of original research data by the researchers who collected them. Primary use Correct Answers Using data for the purpose it was collected. Probability sampling Correct Answers Each member of a population has a known probability of being selected for the sample. Procedural data Correct Answers The data obtained when procedures are coded via a procedural coding system. Qualitative Correct Answers Analysis of data that describes observations about or by a subject. The data is not naturally numeric and must be categorized prior to summary. Quantitative Correct Answers Analysis of data that is naturally numeric. Query Correct Answers A statement in SQL that defines the data to be selected or updated.
Quota sampling Correct Answers "A sampling technique where the population is first segmented into mutually exclusive subgroups, just as in stratified sampling, and then judgment is used to select the subjects or units from each segment based on a specified proportion." R squared Correct Answers See Coefficient of Determination. Random seed Correct Answers A preset starting point for a random number generator. Setting and recording the random seed ensures that the sample is reproducible. Range Correct Answers Differences between the largest and smallest values for a variable Rank Correct Answers Denotes a value's position in a group relative to other values that have been organized in order of magnitude. Ratio scale Correct Answers "Number data where zero has an interpretation and the values may be doubled or multiplied by a constant and still have meaning. Examples of ratio data include currency, length of stay, number of admissions, and age." Relationship database management system Correct Answers A database management system in which data are organized and managed as a collection of tables. Relative value unit (RVU) Correct Answers "A number assigned to a procedure that describes its difficulty and expense
Seed Correct Answers See Random seed. Simple linear regression Correct Answers A statistical technique used to characterize the linear relationship between a dependent variable and one independent variable. Simple Random Sampling Correct Answers Every member of the population has an equal change of being selected for the sample Simple random sampling Correct Answers The process of selecting units from a population so that each one has exactly the same chance of being included in the sample. Slope intercept form Correct Answers "A format for expressing a linear relationship as Y = BX + A. In this formula, B represents the slope of the line and A represents theY-intercept." Spearman's Rho Correct Answers A statistic that measures the strength of the linear relationship between two ordinal variables or one ordinal and one continuous variable. The statistic can range from -1 to +1. SPSS Correct Answers A software system used to perform statistical analysis. Standard deviation Correct Answers A measure of variability that describes the deviation from the mean of a frequency distribution in the original units of measurement; the square root of the variance.
Standard Deviation Correct Answers Shows how widely observations are spread out around the measures of central tendency; square root of variance Standard error Correct Answers "The standard deviation of a statistic. Typically, it is the sample standard deviation divided by the square root of the sample size." Standard normal distribution Correct Answers Normal distribution with a mean of zero and standard deviation of 1. The normal distribution is the basis for the bell curve. Standardized residuals Correct Answers Calculated by subtracting the average residual from each residual and dividing by the standard deviation of the residuals. Statistical Analysis System (SAS) Correct Answers A software system used to perform statistical analysis. Strata Correct Answers Subsets or groupings of subjects that are mutually exclusive and exhaustive. Each subject is assigned to one and only one group. Stratified Random Sampling Correct Answers Organizes the population into similar groups or stratifies the population by a set of criteria; random samples are selected within the groups Stratified random sampling Correct Answers The process of selecting the same percentages of subjects for a study sample as they exist in the subgroups (strata) of the population.
supports the null hypothesis that the two population means are equal. Two-sample Z-test for proportions Correct Answers A hypothesis test for sample proportions that is used to test if the sample data collected supports the null hypothesis that the two population proportions are equal. Type I error Correct Answers "Occurs when the null hypothesis is rejected, yet is true" Type I error Correct Answers A type of error in which the researcher erroneously rejects the null hypothesis when it is true. Type II error Correct Answers "Occurs when the null hypothesis is not rejected, yet is false" Type II error Correct Answers A type of error in which the researcher erroneously fails to reject the null hypothesis when it is false. Uniform Bill-04(UB-04) Correct Answers "The single standardized Medicare form for standardized uniform billing, implemented in 2007 for hospital inpatients and outpatients; this form will also be used by the major third-party payers and most hospitals. " Universe Correct Answers The set of all units that are eligible to be sampled.
Unstructured data Correct Answers Kind of data that includes narrative notes as well as images Unstructured data Correct Answers Non numeric, human- readable data. Examples include note fields in an ERR, images, and recorded transcripts Variability Correct Answers Difference between each value and every other value Variable Correct Answers A characteristic or property that may take on different values. Variance Correct Answers A measure of variability that gives the average of the squared deviations from the mean. wRVU Correct Answers Work component