Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 13 |
Descriptor
Scaling | 22 |
Test Items | 22 |
Test Reliability | 22 |
Item Response Theory | 13 |
Test Construction | 11 |
Test Validity | 11 |
Psychometrics | 10 |
Scores | 10 |
Scoring | 7 |
Test Bias | 6 |
Item Analysis | 5 |
More ▼ |
Source
Author
Petscher, Yaacov | 2 |
Al Otaiba, Stephanie | 1 |
Algina, James | 1 |
Allen, Nancy L. | 1 |
Anderson, Daniel | 1 |
Barrow, Lloyd | 1 |
Bauduin, Charity | 1 |
Benderson, Albert, Ed. | 1 |
Boldt, R. F. | 1 |
Connor, Carol McDonald | 1 |
Daud, Muslem | 1 |
More ▼ |
Publication Type
Education Level
High Schools | 5 |
Middle Schools | 5 |
Secondary Education | 5 |
Early Childhood Education | 4 |
Elementary Education | 4 |
Junior High Schools | 4 |
Primary Education | 4 |
Grade 3 | 3 |
Grade 4 | 3 |
Grade 5 | 3 |
Grade 9 | 3 |
More ▼ |
Audience
Researchers | 2 |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
Assessments and Surveys
ACT Assessment | 1 |
ACT Interest Inventory | 1 |
Kaufman Test of Educational… | 1 |
National Assessment of… | 1 |
North Carolina End of Course… | 1 |
Raven Progressive Matrices | 1 |
What Works Clearinghouse Rating
Myszkowski, Nils – Journal of Intelligence, 2020
Raven's Standard Progressive Matrices (Raven 1941) is a widely used 60-item long measure of general mental ability. It was recently suggested that, for situations where taking this test is too time consuming, a shorter version, comprised of only the last series of the Standard Progressive Matrices (Myszkowski and Storme 2018) could be used, while…
Descriptors: Intelligence Tests, Psychometrics, Nonparametric Statistics, Item Response Theory
Smith, William Zachary; Dickenson, Tammiee S.; Rogers, Bradley David – AERA Online Paper Repository, 2017
Questionnaire refinement and a process for selecting items for elimination are important tools for survey developers. One of the major obstacles in questionnaire refinement and elimination in surveys lies in one's ability to adequately and appropriately reconstruct a survey. Often times, surveys can be long and strenuous on the respondent,…
Descriptors: Surveys, Psychometrics, Test Construction, Test Reliability
Schoen, Robert C.; Anderson, Daniel; Riddell, Claire M.; Bauduin, Charity – Online Submission, 2018
This report provides a description of the development process, field testing, and psychometric properties of the fall 2015 grades 3-5 Elementary Mathematics Student Assessment (EMSA), a student mathematics test designed to be administered in a whole-group setting to students in grades 3, 4, and 5. The test was administered to 2,614 participating…
Descriptors: Elementary School Students, Elementary School Mathematics, Grade 3, Grade 4
Romine, William L.; Schaffer, Dane L.; Barrow, Lloyd – International Journal of Science Education, 2015
We describe the development and validation of a three-tiered diagnostic test of the water cycle (DTWC) and use it to evaluate the impact of prior learning experiences on undergraduates' misconceptions. While most approaches to instrument validation take a positivist perspective using singular criteria such as reliability and fit with a measurement…
Descriptors: Undergraduate Students, Diagnostic Tests, Water, Item Response Theory
Donovan, Courtney; Green, Kathy E.; Seidel, Kent – Leadership and Research in Education, 2017
Core competencies essential for effective teaching were identified via a literature review and a review of standards for teacher education, and vetted by state groups with interests in teacher education. Survey items based on these competencies asked teacher candidates, graduates, and teacher education program faculty how well the program prepared…
Descriptors: Teacher Effectiveness, Item Response Theory, Item Analysis, Test Items
Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities
Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015
For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…
Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests
Topczewski, Anna Marie – ProQuest LLC, 2013
Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…
Descriptors: Item Response Theory, Scaling, Scores, Student Development
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics summative assessments in grades 3 through 8 and high school. The ELA/L assessments focus on reading and comprehending a range of sufficiently complex texts independently and…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics assessments in grades 3 through 8 and high school. New Meridian, in coordination with multiple states and vendors, developed an alternate form of the summative assessment to…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
Kuo, Bor-Chen; Daud, Muslem; Yang, Chih-Wei – EURASIA Journal of Mathematics, Science & Technology Education, 2015
This paper describes a curriculum-based multidimensional computerized adaptive test that was developed for Indonesia junior high school Biology. In adherence to the Indonesian curriculum of different Biology dimensions, 300 items was constructed, and then tested to 2238 students. A multidimensional random coefficients multinomial logit model was…
Descriptors: Secondary School Science, Science Education, Science Tests, Computer Assisted Testing
Frame, Laura B.; Vidrine, Stephanie M.; Hinojosa, Ryan – Journal of Psychoeducational Assessment, 2016
The Kaufman Test of Educational Achievement, Third Edition (KTEA-3) is a revised and updated comprehensive academic achievement test (Kaufman & Kaufman, 2014). Authored by Drs. Alan and Nadeen Kaufman and published by Pearson, the KTEA-3 remains an individual achievement test normed for individuals of ages 4 through 25 years, or for those in…
Descriptors: Achievement Tests, Elementary Secondary Education, Test Validity, Test Reliability
Petscher, Yaacov; Connor, Carol McDonald; Al Otaiba, Stephanie – Assessment for Effective Intervention, 2012
This study investigated the psychometrics of the "Diagnostic Evaluation of Language Variation-Screening Test" (DELV-S) test using confirmatory factor analysis, item response theory, and differential item functioning (DIF). Responses from 1,764 students in kindergarten through second grade were used in the study, with results indicating…
Descriptors: Diagnostic Tests, Screening Tests, Language Variation, Psychometrics
ACT, Inc., 2013
This manual contains information about the American College Test (ACT) Plan® program. The principal focus of this manual is to document the Plan program's technical adequacy in light of its intended purposes. This manual supersedes the 2011 edition. The content of this manual responds to requirements of the testing industry as established in the…
Descriptors: College Entrance Examinations, Formative Evaluation, Evaluation Research, Test Bias
Boldt, R. F. – 1992
The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…
Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency
Douglass, James B. – 1979
A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to develop a plan to compare models for test equating. The five alternative models to be considered for equating are: (1) anchor test equating using…
Descriptors: Equated Scores, Error of Measurement, Latent Trait Theory, Mathematical Models
Previous Page | Next Page »
Pages: 1 | 2