ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	8

Descriptor

Comparative Testing	20
Psychometrics	20
Test Reliability	20
Test Validity	11
Higher Education	5
Multiple Choice Tests	5
College Students	4
Foreign Countries	4
Test Construction	4
Test Format	4
Test Items	4
Adults	3
Measurement Techniques	3
Psychological Testing	3
Scoring	3
Test Norms	3
Test Use	3
Age Differences	2
Attitude Measures	2
Cognitive Processes	2
Comparative Analysis	2
Computer Assisted Testing	2
Correlation	2
Error of Measurement	2
Guides	2
More ▼

Source

Advances in Health Sciences…	1
Educational and Developmental…	1
Educational and Psychological…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Youth and…	1
Measurement:…	1
Physical Review Special…	1
Psychology in the Schools	1
Research Matters	1

Publication Type

Reports - Research	18
Journal Articles	10
Speeches/Meeting Papers	7
Book/Product Reviews	1
Reference Materials -…	1
Reports - Evaluative	1

Education Level

Higher Education	3
Elementary Secondary Education	2
High Schools	2
Secondary Education	2
Postsecondary Education	1
Preschool Education	1

Audience

Location

Canada	2
Australia	1
China	1
Ireland	1
Japan	1
Maryland	1
New Zealand	1
Poland	1
Portugal	1
Singapore	1
South Korea	1
United Kingdom (England)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Autism Diagnostic Observation…	1
Childhood Autism Rating Scale	1
Embedded Figures Test	1
Family Adaptability Cohesion…	1
Family Environment Scale	1
Matching Familiar Figures Test	1
Rod and Frame Test	1
Stroop Color Word Test	1
Wechsler Adult Intelligence…	1
Wechsler Memory Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

How Long Should a High Stakes Test Be?

Download full text

Tom Benton – Research Matters, 2024

Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…

Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction

Comparing Measurement Reliability Estimation Techniques: Correlation Coefficient vs. Bland-Altman Plot

Peer reviewed

Direct link

Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024

The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…

Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

Agreement between a Brief Autism Observational Instrument and Established ASD Measures

Peer reviewed

Direct link

Ward, Samantha L.; Sullivan, Karen A.; Gilmore, Linda – Educational and Developmental Psychologist, 2016

Objective: Limited time and resources necessitate the availability of accurate, inexpensive and rapid diagnostic aids for Autism Spectrum Disorder (ASD). The Autistic Behavioural Indicators Instrument (ABII) was developed for this purpose, but its psychometric properties have not yet been fully established. Method: The clinician-rated ABII, the…

Descriptors: Autism, Pervasive Developmental Disorders, Psychometrics, Diagnostic Tests

Comparison of Integrated Testlet and Constructed-Response Question Formats

Peer reviewed

Direct link

Slepkov, Aaron D.; Shiell, Ralph C. – Physical Review Special Topics - Physics Education Research, 2014

Constructed-response (CR) questions are a mainstay of introductory physics textbooks and exams. However, because of the time, cost, and scoring reliability constraints associated with this format, CR questions are being increasingly replaced by multiple-choice (MC) questions in formal exams. The integrated testlet (IT) is a recently developed…

Descriptors: Science Tests, Physics, Responses, Multiple Choice Tests

The Contribution of Constructed Response Items to Large Scale Assessment: Measuring and Understanding Their Impact

Peer reviewed

Direct link

Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012

This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…

Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Setting the Response Time Threshold Parameter to Differentiate Solution Behavior from Rapid-Guessing Behavior

Peer reviewed

Direct link

Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007

This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…

Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory

Analysis of Cognitive Style Measures.

Download full text

Green, Kathy E. – 1989

The psychometric utility of six experimental cognitive style (CS) measures was analyzed. Examinees were 1,135 clients of the Johnson O'Connor Research Foundation who, during 1985, completed at least one of the six CS tests. Information is provided on measure reliability; relationships among CS measures; relationships with standard battery aptitude…

Descriptors: Age Differences, Aptitude Tests, Cognitive Measurement, Cognitive Style

Test-Retest Reliability of Computerized, Everyday Memory Measures and Traditional Memory Tests.

Youngjohn, James R.; And Others – 1991

Test-retest reliabilities and practice effect magnitudes were considered for nine computer-simulated tasks of everyday cognition and five traditional neuropsychological tests. The nine simulated everyday memory tests were from the Memory Assessment Clinic battery as follows: (1) simple reaction time while driving; (2) divided attention (driving…

Descriptors: Adults, Comparative Testing, Computer Assisted Testing, Computer Simulation

Stability of the McCarthy Scales of Children's Abilities over a One-Year Period for Mexican-American Children.

Peer reviewed

Valencia, Richard R. – Psychology in the Schools, 1983

Examined the stability of the McCarthy Scales of Children's Abilities for a sample of 42 English-speaking and 42 Spanish-speaking Mexican-American preschoolers. The subjects were retested after one year. Concluded that the McCarthy is a relatively stable instrument for English-speaking Mexican-American children. (Author)

Descriptors: Cohort Analysis, Comparative Testing, Culture Fair Tests, Mexican Americans

The Satisfaction with Life Scale: Psychometrics Properties in an Adolescent Sample.

Peer reviewed

Neto, Felix – Journal of Youth and Adolescence, 1993

The applicability of the Satisfaction With Life Scale (SWLS), developed in the United States, to another culture was assessed by investigating reliability and validity of the SWLS with 99 boys and 118 girls from Portugal. The cross-national validity of the scale and its utility with different age groups are supported. (SLD)

Descriptors: Adolescents, Age Differences, Attitude Measures, Comparative Testing

The Supervisory Working Alliance Inventory: A Validity Study.

Download full text

Patton, M. J.; And Others – 1992

The Supervisory Working Alliance Inventory (SWAI) developed by J. F. Efstation, M. J. Patton, and C. M. Kardash (1990) was further evaluated for its psychometric properties and relationships with the Personal Reactions Scale--Revised (PRS-R) developed by E. L. Holloway and B. E. Wampold (1984), the only other measure of the relationship in…

Descriptors: Comparative Testing, Counselor Training, Factor Structure, Higher Education

Comparative Reliability of Two Measures of Family Functioning. Draft.

Download full text

Baranowski, Tom; And Others – 1985

The test reliability of two tests of family functioning--the Family Environment Scale (FES) and the Family Adaptability and Cohesion Evaluation Scales (FACES-II)--was studied in 111 Anglo American, Black American, and Mexican American Families. The sample included children in grades three to six, as well as adults. The FES was administered to the…

Descriptors: Adults, Blacks, Children, Comparative Testing

A Consumer's Guide to Mental Health Treatment Outcome Measures.

Download full text

Pfeiffer, Steven I.; And Others – 1992

A simple way of comparing the more widely used measures of the outcomes of mental health treatment is offered through this list of measures that includes both practical information and information about psychometric considerations. Each review contains information about the format, practicality (administration time and scoring), reliability,…

Descriptors: Behavior Rating Scales, Comparative Testing, Guides, Interviews

Previous Page | Next Page »

Pages: 1 | 2

Andrada, Gilbert N.	1
Baranowski, Tom	1
Bauer, Daniel	1
Bhola, Dennison S.	1
Chang, Lei	1
Downs, Karen M.	1
Elosua, Paula	1
Fischer, Martin R.	1
Gilmore, Linda	1
Green, Kathy E.	1
Guttormsen, Sissel	1
Hou, Xiaodong	1
Huwendiek, Sören	1
Iliescu, Dragos	1
Kong, Xiaojing J.	1
Krebs, René	1
Lahner, Felicitas-Maria	1
Linden, Kathryn W.	1
Lissitz, Robert W.	1
Lörwald, Andrea Carolin	1
Melancon, Janet G.	1
Neto, Felix	1
Nouns, Zineb Miriam	1
Patton, M. J.	1
More ▼