ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	16

Source

Educational and Psychological…	5
Journal of Educational…	3
Journal of Genetic Psychology	2
Advances in Health Sciences…	1
College Teaching	1
Education Digest: Essential…	1
European Physical Education…	1
GED Testing Service	1
Journal of Experimental…	1
Journal of Technology,…	1
National Center for Research…	1
National Foundation for…	1
New Directions for…	1
Psychometrika	1
More ▼

Publication Type

Reports - Evaluative	21
Journal Articles	18
Reports - Research	1
Tests/Questionnaires	1

Education Level

Higher Education	8
Elementary Secondary Education	6
High Schools	2
Middle Schools	2
Elementary Education	1
Grade 7	1
Grade 8	1

Audience

Location

United Kingdom (England)	2
California	1
France	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Cornell Critical Thinking Test	1
General Educational…	1
Graduate Record Examinations	1
Matching Familiar Figures Test	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Afraid Not: Student Performance versus Perception Based on Exam Question Format

Peer reviewed

Direct link

Laprise, Shari L. – College Teaching, 2012

Successful exam composition can be a difficult task. Exams should not only assess student comprehension, but be learning tools in and of themselves. In a biotechnology course delivered to nonmajors at a business college, objective multiple-choice test questions often require students to choose the exception or "not true" choice. Anecdotal student…

Descriptors: Feedback (Response), Test Items, Multiple Choice Tests, Biotechnology

Comparing Construct Definition in the Angoff and Objective Standard Setting Models: Playing in a House of Cards without a Full Deck

Peer reviewed

Direct link

Stone, Gregory Ethan; Koskey, Kristin L. K.; Sondergeld, Toni A. – Educational and Psychological Measurement, 2011

Typical validation studies on standard setting models, most notably the Angoff and modified Angoff models, have ignored construct development, a critical aspect associated with all conceptualizations of measurement processes. Stone compared the Angoff and objective standard setting (OSS) models and found that Angoff failed to define a legitimate…

Descriptors: Cutting Scores, Standard Setting (Scoring), Models, Construct Validity

Which Test? Whose Scores? Comparing Standardized Critical Thinking Tests

Peer reviewed

Direct link

Hatcher, Donald L. – New Directions for Institutional Research, 2011

In this article, after describing one approach for teaching critical thinking (CT) that was in place at Baker University from 1990 to 2008, the author describes their experience assessing CT using three standardized exams and shows why the choice of a standardized CT test can be problematic and the results misleading. These results can be…

Descriptors: Test Results, Essay Tests, Critical Thinking, Thinking Skills

Are Multiple Choice Tests Fair to Medical Students with Specific Learning Disabilities?

Peer reviewed

Direct link

Ricketts, Chris; Brice, Julie; Coombes, Lee – Advances in Health Sciences Education, 2010

The purpose of multiple choice tests of medical knowledge is to estimate as accurately as possible a candidate's level of knowledge. However, concern is sometimes expressed that multiple choice tests may also discriminate in undesirable and irrelevant ways, such as between minority ethnic groups or by sex of candidates. There is little literature…

Descriptors: Medical Students, Testing Accommodations, Ethnic Groups, Learning Disabilities

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Methods of Assessing Body Fatness among Children: Implications for the National Child Measurement Programme

Peer reviewed

Direct link

Wheeler, Sharon; Twist, Craig – European Physical Education Review, 2010

Body mass index (BMI) is increasingly recognized as an inadequate measure for determining obesity in children. Therefore, the aim within this study was to investigate other indirect methods of body fat assessment that could potentially be used in place of BMI. Twenty-four children (boys: 13.8 [plus or minus] 0.8 yr; girls: 13.3 [plus or minus] 0.5…

Descriptors: Obesity, Body Composition, Measurement Techniques, Comparative Testing

Performance of a Generic Approach in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Bridgeman, Brent; Trapani, Catherine – Journal of Technology, Learning, and Assessment, 2010

A generic approach in automated essay scoring produces scores that have the same meaning across all prompts, existing or new, of a writing assessment. This is accomplished by using a single set of linguistic indicators (or features), a consistent way of combining and weighting these features into essay scores, and a focus on features that are not…

Descriptors: Writing Evaluation, Writing Tests, Scoring, Test Scoring Machines

Reliability Analysis for the Internationally Administered 2002 Series GED Tests. GED Testing Service[R] Research Studies, 2009-3

Download full text

Setzer, J. Carl; He, Yi – GED Testing Service, 2009

Reliability Analysis for the Internationally Administered 2002 Series GED (General Educational Development) Tests Reliability refers to the consistency, or stability, of test scores when the authors administer the measurement procedure repeatedly to groups of examinees (American Educational Research Association [AERA], American Psychological…

Descriptors: Educational Research, Error of Measurement, Scores, Test Reliability

New Report, "The Proficiency Illusion," Challenges NCLB

Direct link

McGlynn, Angela Provitera – Education Digest: Essential Readings Condensed for Quick Review, 2008

A new report, "The Proficiency Illusion," released last year by the Thomas B. Fordham Institute states that the tests that states use to measure academic progress under the No Child Left Behind Act (NCLB) are creating a false impression of success, especially in reading and especially in the early grades. The report is a collaboration…

Descriptors: Federal Legislation, Academic Achievement, Rating Scales, Achievement Tests

A Psychometric Evaluation of Two Achievement Goal Inventories

Peer reviewed

Direct link

Donnellan, M. Brent – Educational and Psychological Measurement, 2008

The properties of the achievement goal inventories developed by Grant and Dweck (2003) and Elliot and McGregor (2001) were evaluated in two studies with a total of 780 participants. A four-factor specification for the Grant and Dweck inventory did not closely replicate results published in their original report. In contrast, the structure of the…

Descriptors: Academic Achievement, Psychometrics, Program Validation, Achievement Rating

Computer-Based and Paper-and-Pencil Administration Mode Effects on a Statewide End-of-Course English Test

Peer reviewed

Direct link

Kim, Do-Hong; Huynh, Huynh – Educational and Psychological Measurement, 2008

The current study compared student performance between paper-and-pencil testing (PPT) and computer-based testing (CBT) on a large-scale statewide end-of-course English examination. Analyses were conducted at both the item and test levels. The overall results suggest that scores obtained from PPT and CBT were comparable. However, at the content…

Descriptors: Reading Comprehension, Computer Assisted Testing, Factor Analysis, Comparative Testing

Item Selection Strategy for Reducing the Number of Items Rated in an Angoff Standard Setting Study

Peer reviewed

Direct link

Ferdous, Abdullah A.; Plake, Barbara S. – Educational and Psychological Measurement, 2007

In an Angoff standard setting procedure, judges estimate the probability that a hypothetical randomly selected minimally competent candidate will answer correctly each item in the test. In many cases, these item performance estimates are made twice, with information shared with the panelists between estimates. Especially for long tests, this…

Descriptors: Test Items, Probability, Item Analysis, Standard Setting (Scoring)

A Reanalysis of the CARIN Theory of Conceptual Combination

Peer reviewed

Direct link

Maguire, Phil; Devereux, Barry; Costello, Fintan; Cater, Arthur – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2007

The competition among relations in nominals (CARIN) theory of conceptual combination (C. L. Gagne & E. J. Shoben, 1997) proposes that people interpret nominal compounds by selecting a relation from a pool of competing alternatives and that relation availability is influenced by the frequency with which relations have been previously associated…

Descriptors: Competition, Program Validation, Item Analysis, Human Relations

Generalized Canonical Correlation Analysis of Matrices with Missing Rows: A Simulation Study

Peer reviewed

Direct link

van de Velden, Michel; Bijmolt, Tammo H. A. – Psychometrika, 2006

A method is presented for generalized canonical correlation analysis of two or more matrices with missing rows. The method is a combination of Carroll's (1968) method and the missing data approach of the OVERALS technique (Van der Burg, 1988). In a simulation study we assess the performance of the method and compare it to an existing procedure…

Descriptors: Multivariate Analysis, Matrices, Simulation, Comparative Testing

Exact Small-Sample Differential Item Functioning Methods for Polytomous Items with Illustration Based on an Attitude Survey

Peer reviewed

Direct link

Meyer, J. Patrick; Huynh, Huynh; Seaman, Michael A. – Journal of Educational Measurement, 2004

Exact nonparametric procedures have been used to identify the level of differential item functioning (DIF) in binary items. This study explored the use of exact DIF procedures with items scored on a Likert scale. The results from an attitude survey suggest that the large-sample Cochran-Mantel-Haenszel (CMH) procedure identifies more items as…

Descriptors: Test Bias, Attitude Measures, Surveys, Predictive Validity

Previous Page | Next Page »

Pages: 1 | 2

Comparative Testing	21
Item Analysis	21
Program Validation	6
Equated Scores	5
Psychometrics	5
Test Items	5
Measurement Techniques	4
Multiple Choice Tests	4
Test Results	4
Academic Achievement	3
Computer Assisted Testing	3
Factor Analysis	3
Foreign Countries	3
Measurement Objectives	3
Test Construction	3
Test Format	3
Cognitive Style	2
Construct Validity	2
Cutting Scores	2
Difficulty Level	2
Educational Research	2
Effect Size	2
Evaluation Methods	2
Exit Examinations	2
Goodness of Fit	2
More ▼

Huynh, Huynh	2
Ankenman, Robert D.	1
Attali, Yigal	1
Bijmolt, Tammo H. A.	1
Brice, Julie	1
Bridgeman, Brent	1
Cater, Arthur	1
Chen, Shu-Ying	1
Coombes, Lee	1
Corroyer, Denis	1
Costello, Fintan	1
Crehan, Kevin D.	1
Devereux, Barry	1
Donnellan, M. Brent	1
Ferdous, Abdullah A.	1
Genovese, Jeremy E. C.	1
Haertel, Edward H.	1
Hatcher, Donald L.	1
He, Yi	1
Ho, Andrew D.	1
Kim, Do-Hong	1
Kim, Sooyeon	1
Kirkup, Catherine	1
Koskey, Kristin L. K.	1
More ▼