Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 6 |
Descriptor
Scoring | 11 |
Test Theory | 11 |
Test Validity | 11 |
Test Reliability | 10 |
Item Response Theory | 5 |
Measurement Techniques | 4 |
Test Construction | 4 |
Comparative Analysis | 3 |
Computer Assisted Testing | 3 |
Psychometrics | 3 |
Correlation | 2 |
More ▼ |
Source
Communique | 1 |
Educational Testing Service | 1 |
Journal of Educational… | 1 |
Journal on Educational… | 1 |
Measurement and Evaluation in… | 1 |
Online Submission | 1 |
Physical Review Physics… | 1 |
Author
Publication Type
Reports - Research | 6 |
Journal Articles | 5 |
Books | 2 |
Reports - Descriptive | 2 |
Collected Works - General | 1 |
Guides - Classroom - Learner | 1 |
Information Analyses | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Audience
Researchers | 1 |
Students | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 1 |
Thematic Apperception Test | 1 |
What Works Clearinghouse Rating
Gayle Geschwind; Michael Vignal; Marcos D. Caballero; H.? J. Lewandowski – Physical Review Physics Education Research, 2024
The Survey of Physics Reasoning on Uncertainty Concepts in Experiments (SPRUCE) was designed to measure students' proficiency with measurement uncertainty concepts and practices across ten different assessment objectives to help facilitate the improvement of laboratory instruction focused on this important topic. To ensure the reliability and…
Descriptors: Measurement, Ambiguity (Context), Scientific Concepts, Physics
Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019
This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…
Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Badjadi, Nour El Imane – Online Submission, 2013
The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Descriptors: Essay Tests, Writing Evaluation, Test Validity, Test Reliability
Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie – Measurement and Evaluation in Counseling and Development, 2013
Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…
Descriptors: Item Response Theory, Test Theory, Measures (Individuals), Racial Identification
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests

Jaradat, Derar; Sawaged, Sari – Journal of Educational Measurement, 1986
The impact of the Subset Selection Technique (SST) for multiple-choice items on certain properties of a test was compared with that of two other methods, the Number Right and the Correction for Guessing Formula. Results indicated that SST outperformed the other two, producing higher reliability and validity without favoring high risk takers.…
Descriptors: Foreign Countries, Grade 9, Guessing (Tests), Measurement Techniques
Crocker, Linda; Algina, James – 1986
This text was written to help the reader acquire a base of knowledge about classical psychometrics and to integrate new ideas into that framework of knowledge. The material is organized into five units: (1) introduction to measurement theory; (2) reliability; (3) validity; (4) item analysis in test development; and (5) test scoring and…
Descriptors: Item Analysis, Measurement Techniques, Psychometrics, Scoring
Thathong, Ngamnit; Kruawan, Preecha – 1985
The feasibility of a self-scoring flexilevel test was investigated in terms of practical effectiveness and criterion related validity. The test was administered to over 2,000 students studying secondary school mathematics in Thailand. The study was conducted, and the test administered, in two phases: test development and evaluation. The test was…
Descriptors: Adaptive Testing, Feasibility Studies, Foreign Countries, High Schools
Costantino, Giuseppe; And Others – 1985
The theoretical framework and cross-cultural validation of Tell-Me-A-Story (TEMAS), a projective test developed to measure personality development in ethnic minority children, is presented. The TEMAS test consists of 23 chromatic pictures which incorporate the following characteristics: (1) representation of antithetical concepts which the…
Descriptors: Black Students, Culture Fair Tests, Elementary Education, Hispanic Americans
Linn, Robert L., Ed. – 1993
This collection explores the theory and applications of educational testing. It is divided into sections on theory and general principles of educational measurement, administration of tests and scoring, and applications of testing. The following chapters present information on test theory and use: (1) "Current Perspectives and Future…
Descriptors: Ability, Achievement Tests, Admission Criteria, Cognitive Psychology