Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 8 |
Descriptor
Comparative Testing | 20 |
Psychometrics | 20 |
Test Reliability | 20 |
Test Validity | 11 |
Higher Education | 5 |
Multiple Choice Tests | 5 |
College Students | 4 |
Foreign Countries | 4 |
Test Construction | 4 |
Test Format | 4 |
Test Items | 4 |
More ▼ |
Source
Author
Andrada, Gilbert N. | 1 |
Baranowski, Tom | 1 |
Bauer, Daniel | 1 |
Bhola, Dennison S. | 1 |
Chang, Lei | 1 |
Downs, Karen M. | 1 |
Elosua, Paula | 1 |
Fischer, Martin R. | 1 |
Gilmore, Linda | 1 |
Green, Kathy E. | 1 |
Guttormsen, Sissel | 1 |
More ▼ |
Publication Type
Reports - Research | 18 |
Journal Articles | 10 |
Speeches/Meeting Papers | 7 |
Book/Product Reviews | 1 |
Reference Materials -… | 1 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 3 |
Elementary Secondary Education | 2 |
High Schools | 2 |
Secondary Education | 2 |
Postsecondary Education | 1 |
Preschool Education | 1 |
Audience
Location
Canada | 2 |
Australia | 1 |
China | 1 |
Ireland | 1 |
Japan | 1 |
Maryland | 1 |
New Zealand | 1 |
Poland | 1 |
Portugal | 1 |
Singapore | 1 |
South Korea | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Tom Benton – Research Matters, 2024
Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…
Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction
Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024
The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…
Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Ward, Samantha L.; Sullivan, Karen A.; Gilmore, Linda – Educational and Developmental Psychologist, 2016
Objective: Limited time and resources necessitate the availability of accurate, inexpensive and rapid diagnostic aids for Autism Spectrum Disorder (ASD). The Autistic Behavioural Indicators Instrument (ABII) was developed for this purpose, but its psychometric properties have not yet been fully established. Method: The clinician-rated ABII, the…
Descriptors: Autism, Pervasive Developmental Disorders, Psychometrics, Diagnostic Tests
Slepkov, Aaron D.; Shiell, Ralph C. – Physical Review Special Topics - Physics Education Research, 2014
Constructed-response (CR) questions are a mainstay of introductory physics textbooks and exams. However, because of the time, cost, and scoring reliability constraints associated with this format, CR questions are being increasingly replaced by multiple-choice (MC) questions in formal exams. The integrated testlet (IT) is a recently developed…
Descriptors: Science Tests, Physics, Responses, Multiple Choice Tests
Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012
This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…
Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007
This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…
Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory
Green, Kathy E. – 1989
The psychometric utility of six experimental cognitive style (CS) measures was analyzed. Examinees were 1,135 clients of the Johnson O'Connor Research Foundation who, during 1985, completed at least one of the six CS tests. Information is provided on measure reliability; relationships among CS measures; relationships with standard battery aptitude…
Descriptors: Age Differences, Aptitude Tests, Cognitive Measurement, Cognitive Style
Youngjohn, James R.; And Others – 1991
Test-retest reliabilities and practice effect magnitudes were considered for nine computer-simulated tasks of everyday cognition and five traditional neuropsychological tests. The nine simulated everyday memory tests were from the Memory Assessment Clinic battery as follows: (1) simple reaction time while driving; (2) divided attention (driving…
Descriptors: Adults, Comparative Testing, Computer Assisted Testing, Computer Simulation

Valencia, Richard R. – Psychology in the Schools, 1983
Examined the stability of the McCarthy Scales of Children's Abilities for a sample of 42 English-speaking and 42 Spanish-speaking Mexican-American preschoolers. The subjects were retested after one year. Concluded that the McCarthy is a relatively stable instrument for English-speaking Mexican-American children. (Author)
Descriptors: Cohort Analysis, Comparative Testing, Culture Fair Tests, Mexican Americans

Neto, Felix – Journal of Youth and Adolescence, 1993
The applicability of the Satisfaction With Life Scale (SWLS), developed in the United States, to another culture was assessed by investigating reliability and validity of the SWLS with 99 boys and 118 girls from Portugal. The cross-national validity of the scale and its utility with different age groups are supported. (SLD)
Descriptors: Adolescents, Age Differences, Attitude Measures, Comparative Testing
Patton, M. J.; And Others – 1992
The Supervisory Working Alliance Inventory (SWAI) developed by J. F. Efstation, M. J. Patton, and C. M. Kardash (1990) was further evaluated for its psychometric properties and relationships with the Personal Reactions Scale--Revised (PRS-R) developed by E. L. Holloway and B. E. Wampold (1984), the only other measure of the relationship in…
Descriptors: Comparative Testing, Counselor Training, Factor Structure, Higher Education
Baranowski, Tom; And Others – 1985
The test reliability of two tests of family functioning--the Family Environment Scale (FES) and the Family Adaptability and Cohesion Evaluation Scales (FACES-II)--was studied in 111 Anglo American, Black American, and Mexican American Families. The sample included children in grades three to six, as well as adults. The FES was administered to the…
Descriptors: Adults, Blacks, Children, Comparative Testing
Pfeiffer, Steven I.; And Others – 1992
A simple way of comparing the more widely used measures of the outcomes of mental health treatment is offered through this list of measures that includes both practical information and information about psychometric considerations. Each review contains information about the format, practicality (administration time and scoring), reliability,…
Descriptors: Behavior Rating Scales, Comparative Testing, Guides, Interviews
Previous Page | Next Page »
Pages: 1 | 2