NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 796 to 810 of 1,166 results Save | Export
Peer reviewed Peer reviewed
Marsh, Herbert W.; And Others – Multivariate Behavioral Research, 1992
Results of a reanalysis of previously published data (B. M. Byrne, 1989) support the correlated uniqueness model, diagnostic tests of the validity of confirmatory factor analysis (CFA), multitrait multimethod (MTMM) solutions, inclusion of external validity in MTMM design, and application of factorial invariance to test stability of CFA-MTMM…
Descriptors: Academic Achievement, Construct Validity, Elementary Secondary Education, High Achievement
Peer reviewed Peer reviewed
Banerji, Madhabi; Ferron, John – Educational and Psychological Measurement, 1998
Three analytic approaches were used in a framework of classical test theory to examine the construct validity of a mathematics assessment of 16 constructed response items. Results from 280 elementary school students across four age groups suggest a developmental structure of tasks and subdomains that was generally consistent with the test's…
Descriptors: Age Differences, Child Development, Construct Validity, Constructed Response
Peer reviewed Peer reviewed
Wheeler, Patricia H. – Evaluation Practice, 1995
This volume is the fourth in a series for college faculty and advanced graduate students, "Survival Skills for Scholars." It offers practical advice for developing, using, and grading classroom examinations, focusing on traditional multiple-choice and constructed-response tests rather than alternative assessments. (SLD)
Descriptors: College Faculty, Constructed Response, Grading, Higher Education
Peer reviewed Peer reviewed
Direct linkDirect link
Graham, James M. – Educational and Psychological Measurement, 2006
Coefficient alpha, the most commonly used estimate of internal consistency, is often considered a lower bound estimate of reliability, though the extent of its underestimation is not typically known. Many researchers are unaware that coefficient alpha is based on the essentially tau-equivalent measurement model. It is the violation of the…
Descriptors: Models, Test Theory, Reliability, Structural Equation Models
Peer reviewed Peer reviewed
Direct linkDirect link
Fenna, Doug S. – European Journal of Engineering Education, 2004
Multiple-choice testing (MCT) has several advantages which are becoming more relevant in the current financial climate. In particular, they can be machine marked. As an objective testing method it is particularly relevant to engineering and other factual courses, but MCTs are not widely used in engineering because students can benefit from…
Descriptors: Guessing (Tests), Testing, Multiple Choice Tests, Engineering Education
Peer reviewed Peer reviewed
Direct linkDirect link
Wilson, Mark; Allen, Diane D.; Li, Jun Corser – Health Education Research, 2006
This paper compares the approach and resultant outcomes of item response models (IRMs) and classical test theory (CTT). First, it reviews basic ideas of CTT, and compares them to the ideas about using IRMs introduced in an earlier paper. It then applies a comparison scheme based on the AERA/APA/NCME "Standards for Educational and…
Descriptors: Health Education, Self Efficacy, Health Behavior, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Whitman, Glenn – History Teacher, 2003
In May 2001, students in the author's Advanced Placement (AP) United States History class were embroiled in a controversy surrounding the AP exam, in particular, having access to the exam's Document Based Question (DBQ) and free response portion prior to the test's administration. Prior to the exam, the College Board had provided a fifty-year time…
Descriptors: United States History, Standardized Tests, Advanced Placement Programs, Integrity
Peer reviewed Peer reviewed
Direct linkDirect link
Boman, Peter; Curtis, David; Furlong, Michael J.; Smith, Douglas C. – Journal of Psychoeducational Assessment, 2006
The construct validity of the Australian version of the Multidimensional School Anger Inventory-Revised (MSAI-R) was examined using exploratory factor analysis (EFA), Rasch analysis, and confirmatory factor analysis (CFA) on a sample of 1,400 Australian students enrolled in Years 8 through 12. The EFA revealed a strong replication of the MSAI-R's…
Descriptors: Affective Measures, Psychological Patterns, Construct Validity, Reliability
Kennedy, Lauren Culzean – Online Submission, 2007
This research paper describes the benefits of using an activity-based rhetorical perspective to develop English for specific purposes (ESP) test specifications. This approach expands the potential of ESP test specifications to analyze and describe target language use (TLU) situations, TLU tasks, and ESP test tasks. Multiple activity systems are…
Descriptors: Freshman Composition, Tests, English for Academic Purposes, Rhetorical Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Tzu-Hua – Journal of Computer Assisted Learning, 2007
The web-based formative assessment developed in this research is named Formative Assessment Module of the Web-based Assessment and Test Analysis System (FAM-WATA). FAM-WATA is a multiple-choice web-based formative assessment module containing six effective strategies: 'repeat the test', 'correct answers are not given', 'query scores', 'ask…
Descriptors: Foreign Countries, Self Supporting Students, Student Attitudes, Internet
Zin, Than Than; Williams, John – 1991
Brief explanations are presented of some of the different methods used to score multiple-choice tests; and some studies of partial information, guessing strategies, and test-taking behaviors are reviewed. Studies are grouped in three categories of effort to improve scoring: (1) those that require extra effort from the examinee to answer…
Descriptors: Educational Research, Estimation (Mathematics), Guessing (Tests), Literature Reviews
Wheeler, Patricia; Haertel, Geneva D. – 1993
This handbook addresses the complex and expanding vocabulary of performance assessment and measurement by providing a glossary of related terms and lists of resources for the student, practitioner, and policymaker. Performance assessment includes all forms of such assessment from multiple choice and paper-and-pencil tests to alternative…
Descriptors: Alternative Assessment, Definitions, Educational Assessment, Glossaries
Ross, Steven; Hua, Te-Fang – 1994
A general issue related to language program development involves the empirical rationalization of cut score decisions in criterion-referenced language tests. Cut score dependability focuses on the consistency of the decisions in repeated testing or the assessment of language learner performances. In this case, the issue is to determine the optimal…
Descriptors: Achievement Gains, Criterion Referenced Tests, English (Second Language), Higher Education
Boldt, Robert F. – 1986
This study of the validity of the Graduate Record Examinations (GRE) General Test used data from predictive validity studies that were conducted by the GRE Validity Study Service (VSS) in 79 graduate departments. The performance criterion was first-year grades in graduate school. Observed validities were computed, and for each graduate department…
Descriptors: College Entrance Examinations, Departments, Grade Point Average, Graduate Study
Blair, R. Clifford; Higgins, James J. – 1985
Monte Carlo methods were employed to assess the relative power of the paired samples t test and Wilcoxon's signed-ranks test under ten population shapes. Results of the study indicated that: (1) each of the two statistics was more powerful than the other in given situations; (2) the power advantages of the t test under normal theory were small;…
Descriptors: Estimation (Mathematics), Literature Reviews, Measurement Techniques, Monte Carlo Methods
Pages: 1  |  ...  |  50  |  51  |  52  |  53  |  54  |  55  |  56  |  57  |  58  |  ...  |  78