Statistical analysis is an important tool in the conduct of clinical, basic, and comes research in nursing and allied health.. Thischapter describes the 10 steps in building a research s
Trang 3Old Dominion University Norfolk, Virginia
Trang 4Acquisitions Editor: Hilarie Surrena
Managing Editor: Helene T Caprari
Senior Managing Editor / Production: Erika Kors
Director of Nursing Production: Helen Ewan
Art Director, Design: Holly McLaughlin
Art Director, Illustration: Brett MacNaughton
Manufacturing Coordinator: Karin Duffield
Senior Manufacturing Manager: William Alberti
Production Services / Compositor: Aptara, Inc.
Copyright © 2009 Wolters Kluwer Health | Lippincott Williams & Wilkins.
All rights reserved This book is protected by copyright No part of this book may be reproduced or transmitted in any form or by any means, including as photocopies or scanned-in or other electronic copies, or utilized by any information storage and retrieval system without written permission from the copyright owner, except for brief quotations embodied in critical articles and reviews Materials appearing
in this book prepared by individuals as part of their official duties as U.S government employees are not covered by the above-mentioned copyright To request permission, please contact Lippincott Williams &
Wilkins at 530 Walnut Street, Philadelphia PA 19106, via email at permissions@lww.com or via website at
lww.com (products and services).
9 8 7 6 5 4 3 2 1
Printed in the United States of America
Library of Congress Cataloging-in-Publication Data
Plichta, Stacey Beth,
1965-Statistics for nursing and allied health / Stacey B Plichta, Laurel S Garzon.
The authors, editors, and publisher have exerted every effort to ensure that drug selection and dosage set forth in this text are in accordance with the current recommendations and practice at the time
of publication However, in view of ongoing research, changes in government regulations, and the constant flow of information relating to drug therapy and drug reactions, the reader is urged to check the package insert for each drug for any change in indications and dosage and for added warnings and precautions This
is particularly important when the recommended agent is a new or infrequently employed drug.
Some drugs and medical devices presented in this publication have Food and Drug Administration (FDA) clearance for limited use in restricted research settings It is the responsibility of the health care provider to ascertain the FDA status of each drug or device planned for use in his or her clinical practice LWW.COM
Trang 5I would like to dedicate this book to my family for all of the love and support thatthey have given me, especially to my mother, Patricia Plichta, my husband, BillKellar, and my children, Jesse Carroll and Samantha Lee
I would also like to thank all of my graduate students, who over the years havehelped to shape this book I would like to give special thanks to two of my formerstudents, Dr Yan Zhang and Mr Paul Magnant, who spent many hours readingdrafts, developing homework exercises, and making this book more accessible tostudents My thanks also extend to another former student, YuQuin Zhao, whoprovided a diligent and thorough review of our earlier drafts Finally, I would like
to thank the editors at LWW, notably Helene Caprari, Michelle Clarke, andMargaret Zuccarini, who with much patience and support have seen this projectthrough to completion
Stacey B Plichta
I would like to thank Beth and Anna for their love and Dot for my strength
Laurel S Garzon
Dedications
Trang 7Professor and Chair
Edyth T James Department of Nursing
Columbia Union College
Takoma Park, Maryland
West Virginia University
Morgantown, West Virginia
Willa Marlene Doswell, PhD, RN, FAAN
Associate ProfessorUniversity of Pittsburgh School of NursingPittsburgh, Pennsylvania
Mary Beth Flanders, PhD, RN
Associate ProfessorFay Whitney School of NursingUniversity of WyomingLaramie, Wyoming
David Anthony Forrester, PhD, RN
ProfessorUniversity of Medicine and Dentistry of NewJersey
Newark, New Jersey
Simon Geletta, PhD
Assistant ProfessorDes Moines UniversityDes Moines, Iowa
Beth S Hembree, RN, MSN, DSN
ProfessorDirector of Graduate StudiesLurleen B Wallace College of NursingJacksonville State University
Jacksonville, Alabama
Trang 8James E Hodge, EdD
Professor of Mathematics
Mountain State University
Beckley, West Virginia
Saint Joseph’s College
Brooklyn, New York
Rosalie O Mainous, PhD, ARNP
Acting Associate Dean for Academic Affairs
Saint Joseph’s College
Patchogue, New York
University of Utah College of Nursing
Salt Lake City, Utah
Sarah Ransdell, PhD
Associate Professor
Nova Southeastern University
Fort Lauderdale, Florida
Patsy L Ruchala, DNSc, RN
DirectorOrvis School of NursingUniversity of NevadaReno, Nevada
Cynthia L Russell, PhD, RN
Assistant ProfessorUniversity of Missouri-Columbia Sinclair School
of NursingColumbia, Missouri
Mina D Singh, RN, PhD
Assistant ProfessorYork UniversityToronto, Canada
Ida L Slusher, RN, DSN
ProfessorEastern Kentucky UniversityRichmond, Kentucky
Sandra L Smith, PhD, APRN
Assistant ProfessorUniversity of UtahSalt Lake City, Utah
Darlene Steven, MHSA, PhD, PHN, RN
Professor and CoordinatorLakehead UniversityThunder Bay, Canada
Jessica L Thomson, PhD
Statistician, Adjunct ProfessorLouisiana State University Health SciencesCenter
Baton Rouge, Louisiana
Karen S Ward, PhD, RN, COI
Professor and Associate Director of OnlinePrograms
Middle Tennessee State UniversityMurfreesboro, Tennessee
vi Reviewer List
Trang 9Statistical analysis is an important tool in the conduct of clinical, basic, and comes research in nursing and allied health Nurses and allied health providerswith a focus on practice are best qualified to identify and develop research thataffects the delivery of care and patient outcomes Current concerns about thecost, quality, and equitable delivery (i.e., among all racial and ethnic groups) ofhealth care can be addressed by evidenced-based practices with measurable out-comes of care
out-Evidence-based practice is a process by which systematic inquiry is used toexplain or answer questions of interest Research provides a process for careful,organized, and thoughtful analysis and consideration of myriad evolving prob-lems Health care issues require an ongoing review and analysis of the congru-ency of financial issues, personnel issues, methods of care, and outcomes
A systematic research approach includes a step-by-step process of identifying
a problem or question, selecting the relevant variables, and then using the mostappropriate statistical method This process can overwhelm beginners, butapproaching a research project in an organized way with the knowledge of thebasic statistics provided in this text can lead to a new and exciting understanding
of how powerful statistics can be The choice of a particular statistical methoddepends on the type of question asked (e.g., Are two or more things being com-pared?), the variables selected (e.g., What are the important components of theproblem?), the level of measurement of the variables (e.g., What types of instru-ments will be used?), and the level of certainty that is required This text takes thereader through this process Readers will learn the language of statistics, how toorganize the data, and how to describe the data with simple statistics This textprovides a good beginning to understand what data can reveal about research
Preface
Trang 10DEVELOPING RESEARCH AND STATISTICAL METHODS
Nursing research began in Florence Nightingale’s Notes of Nursing (1859) She
described the factors that she believed affected the outcomes of soldiers in theCrimean war in a systematic way Her notes provided the basis for a review of thecare these soldiers received, which, in turn, stimulated changes in nursing careand public health
Much of the research in the decades to follow was research about the education
of nurses Sigma Theta Tau, the International Honor Society of Nursing, becamethe first organization to fund nursing research in the United States, awarding a
research grant in 1936 In the 1930s, the American Journal of Nursing began lishing nursing research studies In 1952, Nursing Research was created to share the
pub-developing body of nursing studies Today, this journal enjoys an international utation for being the foremost journal for peer-reviewed nursing research
rep-In the 1970s, the focus of research changed to the investigation of nursingpractice and the outcomes of nursing The increased rigor required to investigatepractice-related issues required an in-depth knowledge of research design and sta-tistical methods Peer review was required for publication as additional journals,
such as Research in Nursing and Health and Advances in Nursing Science, were
developed to publish these studies Clinical problems and issues related to practicewere identified as the foremost need for nursing research (Lindeman, 1975)
In 1986, the importance of nursing research was recognized with the lishment of the National Center for Nursing Research (NCNR), which is part ofthe National Institutes of Health This center provides funding for nursingresearch and trains nurse researchers The selection for funding developed into arigorous peer-reviewed process in which the design and statistical methods ofresearch projects are carefully reviewed The funding provided by the NCNR hasgrown to over $150 million annually Current trends in nursing research arefocused on evidenced-based practice (Huddleston, Berkheimer, Landis, et al.,2005; McCormick, Naimark, and Tate, 2006), increased rigor of studies (Im,Page, Lin, Tsai, and Cheng, 2004; Sterling and Peterson, 2005), multidisciplinaryresearch, more visibility for nursing research, and a focus on culture and healthdisparities (Peragallo, DeForge, O’Campo, et al., 2005)
estab-ORGANIZATION OF THE CONTENT
The Role of Research in Health Care
In Chapter 1, “Using Research and Statistics in Health Care,” the role of research
in health care is explored Research in nursing and allied health has contributed
viii Preface
Trang 11to the development of practice guidelines and outcome measures Examples ofresearch questions, designs, and specific statistical methods are discussed Thischapter describes the 10 steps in building a research study plan and includes therelationships among the research questions, study designs, and the selection ofthe appropriate statistics.
to organize and display descriptive data in a meaningful way SPSS is used toobtain frequency tables, descriptive statistics, and histograms In Chapter 3,
“Building Blocks for Using Inferential Statistics,” the process for making ences or decisions based on statistical findings is depicted Probability statistics isdiscussed, and the usefulness of comparing the data to the normal (Gaussian) dis-tribution is demonstrated Readers will understand the process of making ameaningful decision, using knowledge of sampling distribution, and estimatingmeans, z-scores, percentiles, and confidence intervals In Chapter 4, “InferentialStatistics: Finding Relationships in the Data,” hypothesis testing is discussed
infer-Power analysis, selection of p-values, and type I and type II errors are defined.Comparing Means and Medians
Chapters 5 through 8 spell out the ways in which groups may be compared.Chapter 5, “The Independent t-Test and the Mann-Whitney U-Test,” and Chapter
6, “The Paired t-Test and the Wilcoxon Matched-Pairs Test,” describe methodsfor comparing the means and medians of two unrelated or related groups.Chapter 7, “The One-Way ANOVA and the Kruskal-Wallis H-Test,” and Chapter
8, “The Repeated-Measures ANOVA and the Friedman’s ANOVA by Rank,”describe methods for comparisons among three or more groups
Examining Relationships Among Variables Using
Cross-Tabulation Tables and Correlation Coefficients
Chapter 9, “The Pearson and Spearman Correlation Coefficients,” and Chapter
10, “The Chi-Square Statistic and the McNemar Test,” describe the methodsused for examining relationships between continuous variables and relationshipsbetween categorical variables These methods include computing correlation
Trang 12coefficients and constructing cross-tabulation tables The cross-tabulation tablesare analyzed with the chi-square statistic, the Fisher’s exact test, and theMcNemar test.
Basic Model Building
Chapter 11, “Model Building With Linear and Logistic Regression,” provides anintroduction to the use of two multivariate models: multiple linear regression andmultiple logistic regression The discussion includes an assessment of the overallmodel and the interpretation of regression coefficients
Dissemination of Findings
The purpose of research is to answer a question in a rigorous and meaningful way.Chapter 12, “Writing for Publication,” explains the various methods for the dis-semination of results Methods for the presentation of findings in posters, oralpresentations, and journal articles are discussed
BibliographyBarnard, K E., Wenner, W., Weber, B., Gray, C., & Peterson, A (1977) Premature infant refo-
cus In: Miller P (Ed.), Research to practice in mental retardation, Vol 3: Biomedical aspects Baltimore: University Park Press
Huddleston, R., Berkheimer, C., Landis, S., Houck, D., Proctor, A., & Whiteford, J (2005).
Improving patient outcomes in an ambulatory infusion setting Journal of Infusion Nursing, 28 (3), 170-172.
Im, E O., Page, R., Lin, L C., Tsai, H M., & Cheng, C Y (2004) Rigor in cross-cultural
nurs-ing research International Journal of Nursnurs-ing Studies, 41 (8), 891-899.
Lindeman, C A (1975) Delphi survey of priorities in clinical nursing research Nursing Research, 24, 434–441.
McCormick, K., Naimark, B., & Tate, R (2006) Uncertainty, symptom distress, anxiety and
functional status in patients awaiting coronary artery bypass surgery Heart and Lung, 35
(1), 43–45.
Peragallo, N., DeForge, B., O’Campo, P., Lee, S., Kim, Y J., Cianelli, R., & Ferrer, L (2005).
A randomized clinical trial of an HIV-risk reduction intervention among low-income
Latina women Nursing Research, 54, 108–118.
Sterling, Y M., Peterson, J W (2005) Lessons learned from a longitudinal qualitative family
systems study Applied Nursing Research, 18 (1), 44–49.
Trang 135 The Independent t-Test and the Mann-Whitney U-Test: Comparing
6 The Paired t-Test and the Wilcoxon Matched-Pairs Test:
7 The One-Way ANOVA and the Kruskal-Wallis H-Test: Comparing the
8 The Repeated-Measures ANOVA and the Friedman’s ANOVA by Rank:
Trang 14xii Contents
9 The Pearson and Spearman Correlation Coefficients:
10 The Chi-Square Statistic and the McNemar Test:
Appendix A Percent of Total Area of Normal Curve Between a z-Score and the Mean 385
Appendix G Critical Values of Dunn’s Q for Nonparametric Multiple Comparison Testing 399 Appendix H Exact Distribution of the Friedman’s r
2
for the Friedman’s ANOVA
Trang 151
Using Research and
Statistics in Health Care
Learning Objectives
After studying this chapter, you should be able to:
1 Understand the role of research in developing knowledge for use in evidence-based practice guidelines
2 Discuss several ways that research can help policymakers
3 Describe the differences between descriptive and inferential statistics
4 Compose a study plan for the collection and analysis of data
Nurses were at the forefront of the movement to use statistics to improve health care For example, Florence Nightingale (1820–1910) used data from British army files to show how most of the deaths in the British army during the Crimean War (1853–1856) were not caused by direct combat but by ill- nesses contracted off the field of battle or as a result of unattended wounds.
Her statistical analyses convinced the British government to maintain field hospitals and supply nursing care to soldiers Nightingale passed her passion for statistics on to her namesake, Florence Nightingale David, the eldest daughter of her closest friends Florence Nightingale David (1909–1995) became a statistician and worked under Karl Pearson She produced the first edition of Tables of the Correlation Coefficient in 1938 During World War II,
S tatistics are human beings with the tears wiped off.
■ Paul Brodeur, Outrageous Misconduct (1985).
Trang 16she used statistical models to help England prepare for German bombing raids David later left England for the United States and founded the Statistics Department at the University of California, Riverside in 1970 (Salsburg, 2001).
UNDERSTANDING THE ROLE OF EMPIRICAL RESEARCH
Nurses and allied health professionals need a solid understanding of how empiricalknowledge is generated because evidence-based practice has become the standard
by which clinical guidelines are produced (Andrews and Redmond, 2004;McNaughton et al., 2004; Polit and Beck, 2008; Stevens, 2001) The widespread use
of clinical guidelines since the 1980s has led to significant improvements in the comes of health care (Ahlqvist et al., 2006; Brooks, 2004; Penney and Foy, 2007).These guidelines depend on a systematic review of the research evidence (Stevens,2001), which, in turn, requires a sound understanding of statistics and researchmethods (Klardie, Johnson, McNaughton, and Meyers, 2004; Meyers, Johnson,Klardie, and McNaughton, 2004) The Cochrane Collaboration produces the largestcollection of clinical guidelines (Cochrane Collaboration, 2007) This internationalnonprofit organization was founded in 1993 to develop and disseminate systematicreviews of health care interventions In the United States, the U.S PreventiveServices Task Force (U.S Preventive Services Task Force, 2007) has assumed pri-mary responsibility for developing evidence-based guidelines for health care
out-An example of an evidence-based clinical guideline concerns the use of bedrest for back pain (Hagen et al., 2005) Through a systematic review of the litera-ture, Hagen and coworkers concluded that for people with acute low back pain,bed rest is less effective than staying active They also concluded that for patientswith sciatica, there is little or no difference in outcomes among those who stayactive and those who rest in bed Advances in nursing and allied health depend onpractitioners such as Hagen and coworkers who develop guidelines based onempirical research These studies, which are conducted by fellow clinicians, aremost likely to be relevant to developing clinical guidelines (McCormack, 2003).Taking a leading role in research, however, demands an understanding of how toconduct empirical research, including competency in statistics
Research can also help policymakers to identify health care problems thatmay lend themselves to policy solutions For example, the ongoing nursing short-age is predicted to last for 10 years or more because of a number of demographic,environmental, and professional factors (Auerbach, Buerhaus, and Staiger, 2007).Health care organizations have responded to this shortage by encouraging theimmigration of foreign nurses and calling for more graduates from nursingschools (Brush, Sochalski, and Berger, 2004) It appears, however, that somehealth care organizations are less concerned with the educational level of nurses
2 Statistics for Nursing and Allied Health
Trang 17and more concerned with simply having more nurses of any educational ground (e.g., ADN, diploma, BSN).
back-The question suggested by this solution to the nursing shortage—Does thelevel of education of nurses in a given hospital affect patient outcomes?—wasstudied by a team of nurse, medical, and sociologic researchers, and the results
were published in the Journal of the American Medical Society after extensive
peer review (Aiken, Clarke, Cheung, Sloane, and Silber, 2003) The findings ofthis study indicate that a 10% increase in the proportion of hospital nurses withbaccalaureate degrees is associated with a 5% decline in mortality after commonsurgical procedures The researchers used advanced statistical models to accountfor many factors, other than the nurses’ education, that might explain the varia-tion in hospital death rates In addition to the educational preparation of thesenurses, the study also took into account how ill patients were on admission, thesize of the hospital, the technological capacity of the hospital, whether or not itwas a teaching facility, the board certification of the attending surgeons, andpatient-to-nurse staffing ratios Even after statistically controlling for all of thesefactors, there was a clear positive effect of nurse education level on quality ofcare These findings determined that the level of nursing education is critical andthat increasing the number of nurses without concern for educational level hasserious implications for critically ill patients
TYPES OF RESEARCH STUDIES AND STATISTICS
Research studies serve many different purposes Polit and Beck (2008) describedthe four main purposes of empirical research: description, exploration, explana-tion, and prediction and control (considered one category) In general, empiricalstudies use two different categories of statistics: descriptive and inferential
Descriptive statistics are simply numerical or graphical summaries of data,
including charts, graphs, and simple summary statistics such as means and dard deviations, used to describe characteristics of a population sample
stan-Inferential statistics are statistical techniques (e.g., chi-square test, the t-test, the
one-way ANOVA) that allow conclusions to be drawn about the relationshipsfound among different variables in a population sample The one-sample z-test,another example of inferential statistics, allows a comparison of sample data with
a larger population
Descriptive Studies and Descriptive Statistics
Studies whose primary purpose is descriptive and explorative simply describe uations and events These studies use descriptive questions, such as: What is the
Trang 18sit-marital status of people in the United States? or What is the average length ofstay in the hospital after being admitted for an asthma attack? Descriptive sta-tistics are typically used to analyze data from this type of study (see Chapter 2for more information about descriptive statistics) Table 1-1 illustrates the use
of descriptive statistics to answer the question about the marital status ofwomen in the United States by using data from the U.S Census Bureau (U.S.Census Bureau, 2000) As shown in Table 1-1, in the year 2000, approximately52.1% of women were currently married, 24.1% had never married, 10.8%were divorced, 2.5% were separated, and 10.5% were widowed (U.S CensusBureau, 2000)
Explanatory Studies and Inferential Statistics
Studies whose primary purpose is explanatory elucidate the relationships amongvariables These studies are typically more complex than descriptive studies.Their questions and lines of inquiry are often based on established theories fromthe research literature Explanatory studies depend on inferential questions, suchas: Are women who are sedentary during the third trimester of pregnancy morelikely to have a cesarean section (C-section) than women who exercise regularlyduring the third trimester? or Are people with health insurance more likely tohave a longer hospital stay after being admitted for an asthma attack than peoplewithout health insurance? Explanatory studies do not necessarily attempt toestablish causality but often attempt to understand how variables are related toeach other Inferential statistics are used to examine the relationship among vari-ables in an explanatory study (see Chapters 4 to 11 for more information aboutinferential statistics)
An example of an explanatory study is one conducted by Ludwig-Beymer andGerc (2002), who examined the relationship between exercise behavior andreceiving the flu vaccine in a sample of 999 health care workers Table 1-2 shows
4 Statistics for Nursing and Allied Health
TABLE 1-1 MARITAL STATUS OF U.S WOMEN
AGE 15 YEARS AND OLDER
Trang 19the data from this study in a cross-tabulation table A cross-tabulation table is away to graphically display the relationship between two variables Table 1-2 showsthat 46.8% of the health care workers who exercised regularly received theinfluenza vaccine compared with 53.2% of those who did not exercise regularly.Even though these numbers are not identical, chi-square analysis indicates thatthey are not statistically different, meaning that the two groups did not signifi-cantly differ in their likelihood of obtaining the influenza vaccine, and any appar-ent differences are attributable to chance (see Chapter 10 for more informationabout chi-square analysis).
Prediction and Control Studies and Inferential Statistics
Prediction and control studies seek to determine which variables are predictiveand to determine causality (e.g., one event causes another to happen) Predictionand control studies are typically quasi-experimental or experimental studies inwhich researchers introduce an intervention True experimental designs includerandom selection, an intervention, one or more control groups that do notreceive the intervention, and random assignment of the study participants toeither the control group(s) or the intervention group Quasi-experimentaldesigns are similar to experimental designs except that they lack one or more ofthe following: random selection into the study, a true control group, or randomassignment to the intervention or control groups (Polit, 2008) In health-relatedresearch, quasi-experimental designs are often used As with explanatory studies,quasi-experimental studies typically use inferential statistics to analyze the dataand answer the research questions
TABLE 1-2 RELATIONSHIP OF REGULAR EXERCISE TO OBTAINING A FLU
VACCINATION IN 999 HEALTH CARE WORKERSRECEIVED FLU VACCINATION
Note Chi-square p 18 (not statistically significant).
Source: Data from Ludwig-Beymer P, & Gerc SC (2002) An influenza prevention campaign: the employee
perspective Journal of Nursing Care Quality, 16 (3), 1–12.
Trang 20TEN STEPS TO BUILDING A STUDY PLAN
All studies, no matter what the purpose, need to be well-planned For example, instudies in which many variables are measured, it is easy to lose track of the initialpurpose of the study and to generate “results” that appear to be useful Theseresults, however, are meaningless unless they exist in the context of an organizedline of inquiry Box 1-1 lists some of the common mistakes researchers makewhen embarking on research projects These mistakes are often made when there
is no study plan or when the plan is insufficiently detailed Papers resulting fromstudies that have inadequate study plans often lack focus and clarity Althoughseveral well-known methods for writing a study plan are available, they all followthe same basic principles A typical approach is described below
A study plan is a written presentation of how the researcher is going to obtainand analyze the numerical data needed to answer the research questions A goodstudy plan keeps the analysis focused and relevant It serves as the basis for theintroduction and methods section of research papers after the data have been col-lected and analyzed A study plan can also serve as the basis for the first threechapters of a dissertation or thesis In addition, most grants require study planssimilar to the one presented here
The outline that a study plan should follow is summarized in Box 1-2 anddescribed on the following page The method presented here is fairly standard,and similar ones can be found in guides to planning research (Ogden andGoldberg, 2002; Wood, 2006) A study plan begins with a statement of the prob-lem that the study is trying to solve (i.e., the purpose of the study) and a short
6 Statistics for Nursing and Allied Health
EIGHT COMMON MISTAKES IN RESEARCH
1 Undertaking a research project without reviewing the existing literature on the subject
2 Collecting data without a well-defined plan, hoping to make sense of it afterwards
3 Trying to fit meaningful research questions to existing data
4 Defining terms in general or ambiguous language
5 Failing to base research on a sound theoretical foundation
6 Failing to make explicit and clear the underlying assumptions
7 Failing to recognize the limitations of the approach
8 Failing to anticipate rival hypotheses that would account for findings and that challenge
interpretations and conclusions
Source: Courtesy of Dr Brenda Nichols, Dean of the College of Arts and Sciences, Lamar University, Beaumont, Texas.
BOX 1-1
Trang 21description of the significance of the problem The statement of purpose is theguiding force behind the entire research project, and the study should flow from
it A study plan also needs a theoretical or conceptual framework on whichresearch questions and hypotheses are based This framework presents a struc-tured way of thinking about the interrelationships of the variables Researchquestions are either very specific or broadly conceptual The hypotheses, how-ever, must be very specific because they provide the guide for the analysis of thedata The study plan should define key terms and variables, provide a briefdescription of the research design, and describe the sample and how it wasobtained The plan should also state the statistical techniques that will be used totest each hypothesis A good study plan lists any major assumptions and limita-tions of the study being described And finally, a good study plan contains a briefdescription of how the data obtained from the study will be disseminated
Statement of the Problem and Its Significance
A study plan starts with a clear explanation of the purpose of the study and the nificance of the problem to be studied This explanation should include the rea-sons why the study is important and how the study fits into the existing body ofresearch This section orients researchers and interested readers to the study.The statement of the problem should be no more than two or three sentences,and should clearly articulate what the study is seeking to accomplish The ratio-nale for the study should include a brief overview of the epidemiology of theproblem being addressed It should also include a discussion of the monetary andnonmonetary costs of the problem to society, to the health care system, and topeople who have the problem It should also provide a review of other studies in
sig-TEN-STEP STUDY PLAN
1 Statement of the problem and its significance
2 Theoretical or conceptual framework
3 Research questions to be answered by the study
4 List of hypotheses to be tested
5 Definitions of key terms and variables
6 Description of the research design
7 Description of the sample and how it was obtained
8 Description of the planned statistical analysis
9 Statement of assumptions and limitations
10 Dissemination plan
BOX 1-2
Trang 22the literature that have examined similar issues The rationale should then explainthe weaknesses in the literature (e.g., what is not well documented, what is notknown at all) and how the current study will add to the existing knowledge base.
An example of a statement of the problem follows:
The purpose of the study on maintaining physical behavior is to examine the development of exercise habits over a 12-week period and to test the ability of the theory of planned behavior to predict actual participation in physical activity (Armitage, 2005).
Theoretical or Conceptual Framework
All studies need to have an underlying framework that organizes the analysis
by stating how all of the variables are expected to relate to one another In ing a thesis or dissertation, this is accomplished by using (and testing) a theo-retical model from the research literature Theses and dissertations typicallytest and draw conclusions about the validity of theories that already exist in theliterature Even beyond the thesis and dissertation stage, models are vital.Models can keep the analyses organized and coherent They provide a logicalframework that connects the variables to each other and that helps to establishthe relative importance of each variable in predicting the outcome For exam-ple, when testing an intervention, the logical model being tested is that theintervention will affect health status or health behavior variables Other fac-tors, such as age and gender, are also expected to affect health status A goodmodel provides a framework that outlines the expected relationships amongdifferent variables
writ-Numerous theories are used in nursing research Many of these theories aredisease specific However, several theoretical models are useful both in nursingand across disciplines The three models discussed here are examples of theoriesthat can facilitate empirical work: Andersen’s model of health care use, the theory
of unpleasant symptoms, and the theory of planned behavior
Anderson’s Model of Health Care Use
Andersen’s model of health care use postulates that the use of health services is afunction of the perceived need for care, predisposing factors (e.g., cultural fac-tors), and factors that enable patients to obtain care (e.g., knowledge, insurance)(Andersen, 1995) For example, in using Andersen’s model to study condom useamong adolescents, having easy access to condoms is viewed as an “enabling fac-tor.” The effect of easy access to condoms on condom use would be examined inthe context of other factors discussed in the model, such as predisposing factors(e.g., age, gender) and need factors (e.g., adolescents’ perceived need for contra-ception)
8 Statistics for Nursing and Allied Health
Trang 23Theory of Unpleasant Symptoms
The Theory of Unpleasant Symptoms is another useful theory in research related
to health care (Lenz, Pugh, Milligan, Gift, and Suppe, 1997) This theory lates that symptoms do not occur in isolation, meaning that the effects of symp-toms that occur at the same time are multiplicative and not simply additive Thisnursing model includes four dimensions for each symptom: severity or intensity,distress, quality, and time Two additional components have been recently added
postu-to the model: facpostu-tors that influence the symppostu-tom experience and the quences of the symptom experience In general, the model postulates thatfatigue, pain, and impaired functioning are related to experiencing symptom clus-ters Among the many studies that have supported this model are studies of post-partum fatigue in military women (Rychnovsky, 2007), postoperative pain in chil-dren recovering from tonsillectomies (Huth and Broome, 2007), and fatigue inhemodialysis patients (Liu, 2006) Based on this model, Gift and coworkers(2003) examined the effect of time on symptom clusters in patients with lung can-cer The model supported the fact that the symptom cluster was predicted toremain over the course of having lung cancer The symptom cluster over time was
conse-an independent predictor of the patient’s death
Theory of Planned Behavior
A theory often used in the research of individual health behavior and behavioralintentions is the theory of planned behavior (Ajzen, 1991) According to this the-ory, the performance of any behavior depends on behavioral intention Behavioralintention is viewed as being dependent on behavioral beliefs (e.g., attitudetowards the behavior), normative beliefs, and control beliefs Attitudes aredescribed as being the disposition toward the behavior Normative beliefs arebeliefs about the expectations of others, and control beliefs are beliefs about thefactors that may help or hinder the performance of the behavior The frameworkfor the study of physical activity discussed earlier by Armitage (2005) is based onthis theory This study found that behavioral beliefs, normative beliefs, and con-trol beliefs all contribute to physical activity behavior
Research Questions to Be Answered by the Study
Research questions should stem directly from the statement of the problem andthe theoretical framework on which it is based It is also important to groundresearch questions in the existing literature (This is where the significance sec-tion can be useful.) Research questions should be clear about the relationshipsthat are expected Additionally, the research questions must relate directly to thedata that will be collected by the researcher For example, it does not make sense
to have a research question that asks about the relationship of drinking to exercise
Trang 24behavior if data about alcohol use are not collected Research questions asked inArmitage (2005) are listed below.
■ Does attitude toward exercise affect participation in physical activity?
■ Does the extent to which participants perceive themselves as able to exercise (perceived behavioral control) affect participation in physical activity?
■ Do subjective norms affect participation in physical activity?
■ Does intention to exercise predict physical activity?
List of Hypotheses to Be Tested
A hypothesis is a tentative prediction or explanation of the relationship betweentwo or more variables The purpose of a hypothesis is to translate problem state-ments and research questions into predictions of expected outcomes Thehypotheses, therefore, must stem directly from the research questions and theproblem statement and be grounded in the theoretical framework that is chosen
A hypothesis states the expected relationship between the variables This tionship can be either an association in which no causal effect is presumed or acausal relationship in which one variable (the independent variable) is expected
rela-to cause changes in the other variable (the dependent variable) In most cases, thehypotheses stated are directional, that is, they are specific statements of the direc-tion of the relationship between two variables The hypothesized direction of therelationship can be either direct or inverse In a direct relationship, the depen-dent variable increases as the independent variable also increases In an inverserelationship, the dependent variable decreases in value as the independent vari-able increases in value The “direction of the relationship” should be based on thechosen theory and findings from previous research If there are no previous find-ings, it is permissible to base hypotheses on expert opinion or a sound rationale
In a research plan, the hypotheses serve as the guide for data analysis Thereshould be a specific hypothesis for each relationship that is being tested In a dis-sertation, it is not unusual, for example, to have 30 or 40 hypotheses that are beingtested Of course, many of these can be grouped under headings (e.g., sociodemo-graphic characteristics, physical activity) New researchers should be especially
10 Statistics for Nursing and Allied Health
Trang 25careful to write down each relationship that they plan to test to stay organized andfocused Hypotheses tested by Armitage (2005) include the following:
■ Those who perceive themselves as being able to exercise will be more likely to engage in physical activity than those who do not.
■ Those who perceive subjective norms that are supportive of exercise are more likely to exercise.
■ Those with a positive behavioral intention to exercise are more likely to do so than those without a positive behavioral intention to exercise.
Definitions of Key Terms and Variables
In a research plan, it is important to clearly define key terms and variables Terms arebest defined when they first appear so that readers do not initially make assumptionsabout definitions only to discover later that different definitions apply It is especiallyimportant to define terms that readers outside the field of study may not understand.And finally, it is important to include units of measure in the definitions
The variables used in a study must stem directly from the hypotheses.Specifically, variables should measure most, if not all, of the constructs discussed
in the conceptual or theoretical model Furthermore, it should be clearly cated which variables are independent and which are dependent Independentvariables are those that are manipulated (i.e., by the intervention) or that mayaffect the outcome Variables such as age, income, and preexisting health condi-tions are usually considered independent variables Dependent variables arethose that are expected to change in response to the interventions, such as healthstatus, number of visits to the doctor, and costs of inpatient stay Some examples
indi-of variables used by Armitage (2005) include the following:
■ Self-reported physical activity is defined as the participants’ self-report of how many times they participated in physical activity in the past 3 months (rated
on a 7-point scale that ranged from never to frequently).
■ Perceived behavioral control is defined by averaging responses for four Likert scale items (responses ranged from 1 to 7; high scores indicate greater levels of perceived behavioral control) The scale items were:
1. To what extent do you see yourself as being capable of participating in ical activity? (incapable–capable)
phys-2. How confident are you that you will be able to participate in regular cal activity? (not very confident–very confident)
physi-3. I believe I have the ability to participate in regular physical activity nitely do not–definitely do)
(defi-4. How much personal control do you feel you have over participating in lar physical activity? (no control–complete control).
Trang 26regu-Description of the Research Study Design
Every study plan must include a description of how the data will be collected (i.e.,the design of the study) If secondary data (i.e., existing data) are used, it is likelythat the data collection process is described in detail elsewhere, in which case asummary of how the data were collected is provided with a reference to the orig-inal source When collecting original data, however, the study plan must describe
in detail exactly how this will be accomplished This section also contains a briefoverview of the research design as a refresher For more detailed information,standard reference texts can be helpful (Babbie, 2007; Cook and Campbell, 1979;Polit and Beck, 2008)
Research design is the art and science of conducting studies that maximizereliability and validity of the findings and minimize bias The research design alsoensures that the relationship between the dependent and independent variablescan be stated with as much certainty as possible In addition, research designenables the researcher to use statistical analysis when examining data Each sta-tistical technique makes assumptions about the data, and a good study plan max-imizes the extent to which the data meet these assumptions Details of theseassumptions are explained later in the text when statistical techniques are dis-cussed The choice of design depends on several factors, including the type ofproblem, research setting, and available resources Research designs commonlyused in health services research include observational studies, quasi-experimen-tal studies, and experimental studies
Observational Study Designs
Observational studies are those in which a phenomenon is simply observed and
no intervention is instituted They are appropriate when the purpose of the study
is descriptive or correlational The two main types of observational studies arecross-sectional studies and longitudinal studies Cross-sectional studies involvethe collection of data at one point in time; the samples can be random or useintact groups or convenience samples They provide only indirect evidenceabout the effects of time; causal statements cannot be made from analyses ofcross-sectional data Longitudinal studies (i.e., prospective studies) are designed
to collect data at more than one point in time They are appropriate when causallinks need to be established between independent and dependent variables
Quasi-Experimental and Experimental Study Designs
Quasi-experimental and experimental study designs differ from observationalstudy designs in that the researcher is an active agent in the experimental work.Both types of designs are prospective and involve measurements taken during at
12 Statistics for Nursing and Allied Health
Trang 27least two separate points in time, usually a pretest and a posttest measurement.Both types of designs involve a treatment or some type of intervention, an out-come measure, and a comparison group (i.e., control group) from which changecan be inferred and attributed to the treatment Controlled experiments alsoinvolve the random selection of subjects from the population of interest and therandom assignment of subjects to different treatment and control groups Quasi-experimental designs may lack random selection, random assignments, or both.Experimental designs differ from quasi-experimental designs primarily in theamount of control that the experimenter has over external sources of bias and ran-dom error, both of which might call into question the validity and reliability of theresults Evidence from experimental studies is considered to be stronger than evi-dence from quasi-experimental studies A brief discussion of a study design based
on Armitage’s research (2005) follows:
This is an observational, longitudinal study of adult customers of a newly opened fitness center All new members were invited to participate in the study Those agreeing to participate completed a self-administered ques- tionnaire at baseline and an identical questionnaire 3 months later.
Description of the Sample and How It Was Obtained
The mechanism by which participants are selected to be in a study is a critical part
of the research design and is such a complicated topic that it deserves its own tion in the study plan In general, studies attempt to find participants who repre-sent all members of the population of interest because it is generally impossible
sec-to gather data from the entire population For example, researchers interested inobtaining information about women with gestational diabetes may only be able to
gather data from one clinic that serves such women Sampling is the process of
selecting a portion of the population to represent the entire population A majorconsideration in assessing a sample is to make sure it represents the populationbeing studied It is important to state exactly what the target population is, that is,the group to which the researchers want to generalize the study results Evenwith the use of an existing dataset, the study plan should contain an explanation ofhow people were chosen to be in the study It should also contain a brief descrip-tion of the sociodemographic characteristics of the sample
Sampling can be either random (i.e., probability) or nonrandom (i.e., probability) In nonrandom sampling, subjective judgment is used to decide who
non-is chosen for the sample One dnon-isadvantage non-is that it non-is difficult to determinewhether the sample includes members from all relevant segments of the popula-tion Types of nonrandom samples include convenience sampling, snowball sam-pling, and quota sampling Random sampling is the selection of a group ofsubjects from a population so that each individual is chosen entirely by chance
In equal random sampling, each population member has the same probability
Trang 28of being chosen to be in the sample This is also known as “self-weighted” pling Other more complicated random-sampling procedures are sometimes used
sam-in nationwide studies The data from these studies must be analyzed with ized software that accounts for the differences in the probability of each person’sbeing selected into the study In general, a random sample reduces the chances ofbias in the study and increases the external validity A short description of sam-pling from Armitage’s research (2005) follows:
special-■ All study participants were recruited from a single, newly opened fitness ity in the south of England.
facil-■ The final sample consisted of 94 new adult customers.
■ Customers were 56% male, and the average age was 37.57 years (range, 18–65 years).
Description of the Statistical Analysis
Statistical analysis of the data occurs in three stages First, the data must be
“cleaned.” Second, descriptive statistics are used to describe the sample in terms
of demographic characteristics Descriptive statistics are also used to examine anddescribe the dependent and independent variables Third, each hypothesis islisted with the inferential test that will be used to test it The actual choice of testdepends on the sample size, the measurement scale of the variables in the hypoth-esis, and the distribution of those variables The tests chosen initially in the studyplan may change as more information about the nature of the variables is obtained
Cleaning the Data
Cleaning the data involves making certain that all of the variables have valid andusable values This step is completed by running frequencies on every variable(see Chapter 2) and examining those frequencies carefully for valid values,unusual values, large amounts of missing data, and adequate variability Forexample, if the variable gender has a value of 1 for men and a value of 2 forwomen, any cases that listed a value of 3 for gender would have to be examinedand explained The frequency distribution of each variable is then checked forunusually large or small values to be sure that they were accurately entered intothe database For example, if a participant’s weight is listed as 890 lb, it should bechecked to see if it is not actually 89 lb because of a data entry error In somecases, the paper copies from which the data were entered can be checked foraccuracy In other cases, especially if the study is based on secondary data, thedata cannot be checked All invalid and out-of-range values must then be defined
as “system missing” so they will not be included in the final data analysis
The next step in cleaning data is to check the variables with missing data Iftoo many participants are missing values for a given variable, the variable may not
14 Statistics for Nursing and Allied Health
Trang 29be usable Additionally, the variables must be examined for adequate variabilitywithin each variable If almost everyone answers the same way to a specific vari-able (e.g., if 98% of the group is female), then that variable cannot be used dur-ing the analysis because it really is not a “variable.”
Describing the Sample
The second step in data analysis is to describe the sample characteristics usingdescriptive statistics It is common practice to create tables that display samplesociodemographic characteristics, such as age, gender, ethnicity, and educationlevel These descriptions help readers understand the study population Theoverall values and distribution of the key independent and dependent variablesare also described (see Chapter 2 for details)
Inferential Statistics Used to Test Each Hypothesis
The third step in data analysis is to list the inferential statistics that will be used totest the hypotheses The hypotheses, including the independent and dependentvariables in each hypothesis, should be clearly stated The exact test performed totest each hypothesis depends on the research design, sample size, distribution ofthe variables (i.e., normal vs nonnormal), measurement scale of the variables inthe hypothesis (nominal, ordinal, interval, ratio), and type of comparisons to bemade In general, nonparametric statistics are used for small sample sizes and forvariables that are not normally distributed, and parametric statistics are used foranalyses of large sample sizes with normally distributed variables These tech-niques are described in Chapters 4 to 10 Some study plans may include multi-variate statistical analyses to control for confounding variables and to eliminaterival hypotheses These are described briefly in Chapter 11
Statement of Assumptions, Limitations, and Delimitations
Every study has assumptions, limitations, and delimitations, all of which must be
stated explicitly Assumptions are statements that are taken to be true even
though the direct evidence of the truth is either absent or not well documented.For example, in the exercise study, it is assumed that participants were honest
about their level of physical activity Limitations are weaknesses or handicaps that
potentially limit the validity of the results For example, a limitation of the cal activity study is that it uses an intact group (i.e., clients of a single fitness facil-ity) rather than a random sample of clients from several facilities This may limitthe ability of the study to be generalized Note that common limitations in “real-life” outcomes research are often small sample sizes (i.e., samples with fewer than
physi-100 participants), poor response rates, poor follow-up rates, lack of random selection,
Trang 30lack of random assignment, and lack of diversity in the sample Delimitations are
boundaries in which the study was deliberately confined (e.g., the physical ity study focuses on adults rather than on adolescents and adults)
activ-Dissemination Plan
A study is not truly complete until the knowledge gleaned from the work is seminated for use Arguably, the most important part of a research project is shar-ing the knowledge obtained from the project Information can be shared in manydifferent forms First and foremost, the results of the study should be shared withthe sites at which the study was held to help the sites improve their clinical prac-tice The results can also be more widely publicized throughout the local areathrough grand rounds, regional conferences, and newsletters
dis-A higher level of dissemination occurs when the paper is presented in reviewed arenas (see Chapter 12) These forums include statewide conferences,national conferences, and peer-reviewed publications Statewide conferences(e.g., annual meeting of the Virginia Public Health Association) typically providespace in which researchers can present their results in a poster format Nationalconferences (e.g., annual meeting of the American Public Health Association)typically provide opportunities to present research as posters, oral presentations
peer-on panels, and roundtables Publishing in a peer-reviewed journal, however, vides the best way to widely disperse the results of a study Most peer-reviewedjournals are indexed in one or more electronic databases (e.g., Medline orCINALH) These databases are commonly available at university libraries and onthe Internet Indexing in these databases makes the research available to millions
pro-of researchers and policymakers around the world
16 Statistics for Nursing and Allied Health
Trang 31CRITICAL THINKING CONCEPT REVIEW
1 Find a peer-reviewed research article and do the following:
a Write the problem statement
b Name the theoretical model used or briefly describe the overall tual model
concep-c Write out the main research questions What is the rationale for the question?
d List the main hypotheses that the study is testing
e Define the dependent variable and the main independent variables
f Briefly describe the research design
g Describe the sample (e.g., size, sociodemographic characteristics) and how
it was obtained
h List the statistics used to test the hypotheses
i Identify the main assumptions and limitations
2 Describe three studies that were critical to the practice of nursing or anallied health profession in the United States
3 Choose a topic in which you are interested and find (or conceptualize) a datasetrelated to that topic State a research problem that can be answered by thatdataset and write a 10-step study plan that will allow you to analyze your data
Trang 3218 Statistics for Nursing and Allied Health
CRITICAL THINKING CONCEPT REVIEW
1 A peer-reviewed article may not explicitly state the different steps (e.g., lem statement, hypotheses) However, you should be able to identify each ofthese items through a careful reading of the article
prob-2 Any study that affected either clinical practice or health policy is acceptable
3 The important thing in this exercise is to follow the 10-step plan
BibliographyAhlqvist, M., Bogren, A., Hagman, S., et al (2006) Handline of peripheral intravenous cannu-
lae: effects of evidence-based clinical guidelines Journal of Clinical Nursing, 15,
1354–1361.
Aiken, L., Clarke, S., Cheung, R., Sloane, D., & Silber, J (2003) Educational levels of hospital
nurses and surgical patient mortality Journal of the American Medical Association, 290,
1617–1623.
Ajzen, I (1991) Theory of planned behavior Organizational Behavior and Human Decision Processes, 50, 179–211.
Andersen, R M (1995) Revisiting the behavioral model and access to medical care: Does it
matter? Journal of Health and Social Behavior, 36 (1), 1–10.
Andrews, E J., & Redmond, H P (2004) A review of clinical guidelines British Journal of Surgery, 91 (8), 956–64.
Armitage, C J (2005) Can the theory of planned behavior predict the maintenance of
physi-cal activity? Health Psychology, 24 (3), 235–245.
Auerbach, D., Buerhaus, P I., & Staiger, D O (2007) Trends: better late than never:
work-force supply implications of later entry into nursing Health Affairs, 26 (1), 178–186 Babbie, E (2007) The practice of social research Belmont, CA: Wadsworth.
Brodeur, P (1985) Outrageous misconduct: The asbestos industry on trial New York:
Pantheon.
Brooks, W (2004) The use of practice guidelines for urinary incontinence following stroke.
British Journal of Nursing, 13 (2), 1176–1179.
Brush, B L., Sochalski, J., & Berger, A M (2004) Imported care: recruiting foreign nurses to U.S health care facilities: importing nurses is likely to remain a viable and lucrative strat-
egy for plugging holes in the U.S nurse work-force Health Affairs, 23 (3), 78–87.
Cochrane Collaboration (2007) (http://www.cochrane.org/docs/descrip.htm).
Cook, T D., & Campbell, D T (1979) Quasi-experimentation: Design & analysis issues for field settings Boston, MA: Houghton Miffin Company.
Gift, A., Stommel, M., Jablonski, A., & Given, W (2003) A cluster of symptoms over time in
patients with lung cancer Nursing Research, 52 (6), 393–400.
Hagen, K B., Jamtvedt, G., Hilde, G., & Winnem, M F (2005) The updated Cochrane
Review of bed rest for low back pain and sciatica Spine, 30 (5), 542–546.
Chapter
Trang 33Huth, M M., & Broome, M E (2007) A snapshot of children’s postoperative tonsillectomy
outcomes at home Journal of Surgical and Postoperative Nursing, 12 (3), 186–195.
Klardie, K A., Johnson, J., McNaughton, M A., & Meyers, W (2004) Integrating the
princi-ples of evidence-based practice into clinical practice Journal of the American Academy of
Nurse-Practitioners, 16 (3), 98–105.
Lenz, E R., Pugh, L C., Milligan, R A., Gift, A., & Suppe, F (1997) The middle-range
the-ory of unpleasant symptoms: an update Advances in Nursing Science, 19 (3), 14–27 Liu, H E (2006) Fatigue and associated factors in hemodialysis patients in Taiwan Research
in Nursing and Health, 29, 40–50.
Ludwig-Beymer, P., & Gerc, S C (2002) An influenza prevention campaign: the employee
perspective Journal of Nursing Care Quality, 16 (3), 1–12.
McCormack, B (2003) Focus Knowing and acting: a strategic practitioner-focused approach
to nursing research and practice development NT Research, 8 (2), 86–100.
McNaughton, M A., Klardie, K., Meyers, W., & Johnson, J (2004) Integrating the principals
of evidence-based practice: testing and diagnosis Journal of the American Academy of
Nurse-Practitioners, 16 (1), 2–7.
Meyers, W C., Johnson, J A., Klardie, K., & McNaughton, M A (2004) Integrating the
prin-ciples of evidence-based practice: prognosis and the metabolic syndrome Journal of the
American Academy of Nurse Practitioners, 16 (5), 178–184.
Ogden, T E., & Goldberg, I A (2002) Research proposals: A guide to success (3rd ed) San Diego: Academic Press.
Penney, G., & Foy, R (2007) Do clinical guidelines enhance safe practice in obstetrics and
gynaecology? Best Practice & Research Clinical Obstetrics and Gynaecology, 21 (4),
657–673.
Polit, D F., & Beck, C T (2008) Nursing research: Generating and assessing evidence for
nursing practice (8thed) Philadelphia: Lippincott Williams & Wilkins.
Rychnovsky, J D (2007) Postpartum fatigue in the active duty military woman Journal of
Obstetric, Gynecologic, and Neonatal Nursing, 36 (1), 38–46.
Salsburg, D (2001) The lady tasting tea: How statistics revolutionized science in the twentieth
century New York: W.H Freeman.
Stevens, S S (2001) Systematic reviews: the heart of evidence-based practice AACN Clinical
Issues, 12 (4), 529–538.
U.S Bureau of the Census (2000) Statistical abstract of the United States: 2000 (120thed.).
Washington, D.C (http://factfinder.census.gov).
U.S Preventive Services Task Force (2007) (http://www.ahrq.gov/clinic/uspstfix.htm).
Wood, M J (2006) Basic steps in planning nursing research: From question to proposal
(6 th ed.) Boston: Jones & Bartlett Publishers.
Trang 34After studying this chapter, you should be able to:
1 Correctly identify the measurement scale of a variable
2 Recognize and define various symbols used in statistics
3 Identify the three main ways of organizing and presenting data
4 Construct a frequency table
5 Graphically display data, using bar charts, histograms, stem-and-leaf plots, frequencypolygons, and cumulative frequency polygons
6 Understand the use of percentiles and how to obtain them from an ogive
7 Describe variables, using measures of central tendency, dispersion, shape, and skewness
8 Obtain measures of central tendency, dispersion, shape, and skewness in SPSS
9 Correctly interpret SPSS printouts of measures of central tendency, dispersion, shape, andskewness
The stem-and-leaf plot was invented by John Wilder Tukey (1915–2000) in the 1960s as a way to present the distribution of a variable Tukey also coined the terms “bit” (for binary digit) and “software” (for computer programs)
Trang 35(Salsburg, 2001) A prolific writer and versatile mathematician, Tukey was an only child who was homeschooled; his formal education did not begin until he entered Brown University to study mathematics and chemistry After receiv- ing his doctorate, Tukey become an instructor at Princeton University He left this position during World War II to join the Fire Control Research Office After the war, Tukey worked with AT&T Bell Laboratories while maintaining his position at Princeton He authored hundreds of papers during his career and pioneered many techniques in exploratory data analysis, time-series analysis, and multiple comparisons (Cleveland, 1988).
LANGUAGE OF STATISTICS
Statistics is the field of applied mathematics that is concerned with the collection,analysis, and interpretation of data Statistics, as a field, uses its own language,including special symbols that represent formulas and other mathematicalexpressions and specialized terms that describe different variables This chapterintroduces and defines some of these symbols and specialized terms Some of themore commonly used symbols can be found in Table 2-1
What Is a Variable?
When doing a research project, data are typically collected from a sample lation and then analyzed to answer research questions that are directly related tothe collected data Variables measure the different characteristics or parameters
popu-of a given sample A variable is any characteristic that can and does assume ferent values from person to person in a population or sample of interest Forexample, demographic variables describe basic characteristics of human popula-tions, such as gender, age, ethnicity, marital status, number of children, educationlevel, employment status, and income To be considered a “variable,” however, it
dif-is critical that everyone in a given sample does not have the same value for thdif-isvariable For example, gender, which can assume two values (male and female), isnot a variable when studying a population of pregnant women Because everyone
in such a study is female, gender does not vary and thus cannot be considered a
“variable” in the study
The values of the variables are manipulated in different ways to compute ferent statistics The statistics are represented by mathematical equations, whichare written using mathematical symbols These notations can be difficult to learn,but they are essential when writing formulas Mathematical symbols are usedthroughout this text Table 2-1 provides an overview of some of the symbols andmathematical notations commonly used in statistics
Trang 36dif-22 Statistics for Nursing and Allied Health
TABLE 2-1 MATHEMATICAL SYMBOLS IN STATISTICS
2 Sum of the squared x’s; square each value of the variable x and then add up
all the squared values (xi ) 2 Sum of the x’s, squared; add up all the values of the variable x and then
square the total
x The absolute value of x
p (A) Probability of event A happening (marginal probability)
p (A B) Probability of event A happening if B happens (conditional probability)
p (A B) Probability of both event A and event B happening (intersection of A and B)
p (A B) Probability of event A happening or event B happening (union of A and B)
Statistical Symbols
Alpha: the significance level set for the study
p The p-value of the computed statistic
H0 The null hypothesis
H A The alternative hypothesis
Type I error in hypothesis testing error Type II error in hypothesis testing
N Population size
n Sample size
f Frequency
p i , p 95 Percentile rank at the ith percentile, 95th percentile
Mu, the population mean
x– x-bar, the sample mean
2 Sigma squared, the population variance
Sigma, the population standard deviation
Population correlation coefficient
r Sample correlation coefficient
Trang 37Measurement Scales
Measurement is defined as the process of assigning numerical values to differentcharacteristics of the observations so that they can be differentiated from oneanother (Vogt, 2005) In data analysis, four basic types of measurement are used:categorization, rank ordering, interval ordering, and numerical scoring Each ofthese types of measurement corresponds to one of four measurement scales:nominal (categorization), ordinal (rank ordering), interval (interval ordering), orratio (numerical scoring) (Babbie, 2007; Polit and Beck, 2008)
vari-“dichotomous” variables
Ordinal Variables
Ordinal scale variables can be placed in a meaningful numerical order (e.g., fromlowest to highest), but there is no information about the specific size of the inter-val between the different values, and there is no “true zero.” Numbers are simplyused to put observations in rank order For example, it is possible to rank militarypersonnel from lowest to highest (e.g., private, corporal, sergeant, lieutenant),but it is impossible to say anything meaningful about how much greater a corpo-ral is than a private It is also impossible to say if the interval between a privateand a corporal is the same as the interval between a sergeant and a lieutenant Anordinal variable can have the number zero as one of its possible values, but it willstill not be a “true zero.” For example, the value 0 can be assigned to the rank ofprivate, 1 to the rank of corporal, and so on However, the number 10 could just
as easily be assigned to the rank of private and the number 20 to the rank of poral without changing the meaning of the variable as long as the higher ranks areassigned higher numbers
cor-All subjective rating scales are considered ordinal, including satisfactionscales, ratings of pain or discomfort, measures of psychological states such asdepression, opinion scales, and Likert scales and scale items Likert scales typi-cally use the following format: “Please rate your satisfaction or opinion on a scale
Trang 38of 1 to 5, where 1 is strongly disagree and 5 is strongly agree.” They may haveother response categories (e.g., strongly like, strongly dislike) as well (Likert,Roslow, and Murphy, 1934) Likert scales were originally developed by RensisLikert as part of his doctoral dissertation at Columbia University The scales wereconcerned with measuring psychological attitudes in a quantifiable way TodayLikert scales are commonly used in many studies of health care quality and out-comes.
Interval Variables
Interval scale variables can be placed in a meaningful numerical order and havemeaningful intervals between values because the units of measure are in equalintervals However, interval scales do not have a “true zero,” and ratios of scorescan not be meaningfully calculated Fahrenheit temperature is a good example of
an interval scale because it has no true zero Zero degrees Fahrenheit is not anabsolute zero (as it is in the Kelvin scale of temperature), and 80°F is not twice ashot as 40°F Note that few scales used by health care researchers, other than IQscales and some psychological measurements, are interval scales
Ratio Variables
Ratio scale variables can be placed in meaningful numerical order, have ingful intervals, and have a “true zero.” Most biomedical variables (e.g., weight,height, blood pressure, pulse rate) are ratio variables They all have a true zeroinsofar as the measurement needs to be anchored at zero to be accurate Forexample, if a patient admits to weighing 158 lb, it is assumed that the patient’sscale started at 0 and not at 5 lb Also, ratios of the values are meaningful for thislevel of data A pulse rate of 140 bpm is twice as fast as a pulse rate of 70 bpm.Other examples of ratio variables include age, income, and number of children
mean-ORGANIZING AND DISPLAYING DATA
The first step in any data analysis is to understand the distribution of the values ofthe variables Specifically, it is important to know four things about the measures
of each variable: its central tendency, dispersion, shape, and outliers Measures of
central tendency are the values that best represent the middle of a distribution;
they provide information about the values that are most typical The mean,
median, and mode are all measures of central tendency The measures of sion describe the extent to which the values of the variable are spread out around
disper-the measure of central tendency The standard deviation, interquartile range, and
24 Statistics for Nursing and Allied Health
Trang 39range are all measures of dispersion The shape of the distribution describes how
the values of the variable are distributed (symmetrically or asymmetrically)around the measure of central tendency This is typically determined visually
using graphical methods such as histograms or stem-and-leaf plots Outliers are
values that do not fit the pattern of the rest of the values of the variable; they areeither much larger or much smaller than the rest of the values, and they stand outbecause they are unusual The three common ways of presenting and organizing
data to describe its distribution are descriptive statistics, frequency distributions, and graphical displays.
Questions Answered by Descriptive Statistics
Descriptive statistics are used to describe and summarize data to make themmore meaningful These include measures of central tendency, dispersion, shape,and skewness Descriptive statistics are often presented visually using frequencydistributions and graphical displays such as histograms and bar charts For exam-ple, descriptive statistics were used to examine the rates of chronic conditions inpatients by race, ethnicity, and age in an attempt to answer the question: Arethere ethnic disparities in the rates of chronic conditions among elderly individu-als? This was at the core of a report about the health of older Americans by theCenter on an Aging Society at Georgetown University (Center on an AgingSociety, 2004) Because the increasingly diverse society in the United State pro-vides challenges to health care professionals to provide culturally competent care,
it is important that these providers understand how membership in different tural groups (e.g., ethnic, religious) can affect health For example, in this study,the rates of chronic conditions were found to be higher among older AfricanAmericans (77%) and Latinos (68%) than among whites (64%) or AsianAmericans (42%) In addition, this report found that racial and ethnic minoritieswere less likely to have a primary care physician and health insurance These dataabout elderly Americans can help to provide guidance to health sciencesresearchers, policymakers, and educators who seek to address the disparities inaccess to health care in this country
cul-The Pittsburgh Healthy Heart Project followed a group of asymptomatic,community-dwelling adults for 3 years to examine the progression of subclinicalcardiovascular disease (Stewart, Janicki, and Kamarck, 2006) The researchersused descriptive statistics to organize data from participants in an attempt toanswer the question: How does cardiovascular disease progress in an asympto-matic population (Stewart, Janicki, and Kamarck, 2006)? In particular, theauthors used descriptive statistics to describe the population at baseline and again
at follow-up At baseline, about half (50.9%) of the participants were women, and13% were nonwhite; the average age was 60.1 years (standard deviation, 4.6years) Information about baseline risk factors was also provided: at baseline,
Trang 409.7% of the participants smoked, the average body mass index (BMI) was 27.5(standard deviation, 4.6), and the average systolic blood pressure was 121.8 mm
Hg (standard deviation, 9.6) Data from a 3-year follow-up were compared withthe original data so the researchers could understand the factors that cause theserisk factors to change over time
Overall, the use of descriptive statistics in these two studies allows readers tobetter understand the population on which the studies were focused Thedescriptive statistics also serve to inform readers about the health conditions stud-ied in each population
Frequency Distributions
Frequency distributions provide a way of organizing data using a table format.They allow readers to grasp the basic characteristics of the distribution of the vari-able quickly When the data are organized into a table format, it is much easier forreaders to glean information about the central tendency, dispersion, and outliers
of the variable of interest When the data are not organized, it is much harder to
do so For example, Table 2-2 presents the body weight (in pounds) of 39 diabeticwomen using data from the Behavioral Risk Factor Surveillance System Survey(BRFSS; Centers for Disease Control and Prevention, 2000) These data are rankordered but not otherwise organized From this table alone, it is not immediatelyobvious where the middle of the distribution lies, what the shape of the distribu-tion is, or how many of the values are outliers
A frequency distribution table shows the possible values of the variablegrouped into class intervals, the raw and relative frequencies of those intervals,and the cumulative frequency The frequency distribution shown in Table 2-3was created from the data in Table 2-2 Table 2-3 demonstrates how much easier
it is to glean information from the data using this format The typical values ofthe variable (body weight), its dispersion, and the outliers are all immediatelyobvious For example, the majority of the women weigh between 180 and 199 lb,but a fairly even number of women weigh either more or less than this In addi-tion, at least one outlier is obvious—the woman who weighs between 300 and
319 lb
Frequency tables are useful when describing ordinal, interval, or ratio data.However, only limited frequency tables can be created for nominal-level vari-ables If the variable of interest is nominal, such as marital status, a table thatshows the raw and relative frequency of each response category could be created.There would be no class intervals or cumulative frequencies Furthermore, theresponses would not be organized in any set way Table 2-4 is an example of a fre-quency table created to describe the marital status of the 39 study participantsfrom the BRFSS
26 Statistics for Nursing and Allied Health
...disper-the measure of central tendency The standard deviation, interquartile range, and< /i>
24 Statistics for Nursing and Allied Health< /small>
Trang... marital status of the 39 study participantsfrom the BRFSS26 Statistics for Nursing and Allied Health< /small>
... skewness Descriptive statistics are often presented visually using frequencydistributions and graphical displays such as histograms and bar charts For exam-ple, descriptive statistics were used