Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 17 |
Descriptor
Psychometrics | 53 |
Test Construction | 53 |
Testing Problems | 53 |
Educational Assessment | 18 |
Test Validity | 18 |
Evaluation Methods | 16 |
Measurement Techniques | 15 |
Test Items | 15 |
Test Reliability | 11 |
Computer Assisted Testing | 10 |
Evaluation Problems | 9 |
More ▼ |
Source
Author
Dings, Jonathan | 2 |
Wainer, Howard | 2 |
Weiss, David J. | 2 |
Andrada, Gilbert N. | 1 |
Baker, Robert L. | 1 |
Belmont, John M. | 1 |
Burstein, Leigh | 1 |
Calsyn, Donald A. | 1 |
Church, Austin T. | 1 |
DiBello, Lou | 1 |
Ekstrom, Ruth B. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 11 |
Higher Education | 3 |
Elementary Education | 2 |
Postsecondary Education | 2 |
Audience
Researchers | 4 |
Practitioners | 2 |
Students | 1 |
Location
Kentucky | 3 |
United States | 3 |
United Kingdom | 2 |
China | 1 |
Colombia | 1 |
Germany | 1 |
Ireland | 1 |
United Kingdom (England) | 1 |
United Kingdom (Wales) | 1 |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
SAT (College Admission Test) | 2 |
Advanced Placement… | 1 |
Armed Services Vocational… | 1 |
Cognitive Assessment System | 1 |
Leiter International… | 1 |
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Janssen, Gerriet – Language Testing, 2022
This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…
Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty
Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020
Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…
Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics
Norfolk, Philip A.; Farmer, Ryan L.; Floyd, Randy G.; Woods, Isaac L.; Hawkins, Haley K.; Irby, Sarah M. – Journal of Psychoeducational Assessment, 2015
The representativeness, recency, and size of norm samples strongly influence the accuracy of inferences drawn from their scores. Inadequate norm samples may lead to inflated or deflated scores for individuals and poorer prediction of developmental and academic outcomes. The purpose of this study was to apply Kranzler and Floyd's method for…
Descriptors: Intelligence Tests, Psychometrics, Sample Size, Norm Referenced Tests
Knell, Janie L.; Wilhoite, Andrea P.; Fugate, Joshua Z.; González-Espada, Wilson J. – Electronic Journal of Science Education, 2015
Current science education reform efforts emphasize teaching K-12 science using hands-on, inquiry activities. For maximum learning and probability of implementation among inservice teachers, these strategies must be modeled in college science courses for preservice teachers. About a decade ago, Morehead State University revised their science…
Descriptors: Item Response Theory, Multiple Choice Tests, Test Construction, Psychometrics
Orrill, Chandra Hawley; Kim, Ok-Kyeong; Peters, Susan A.; Lischka, Alyson E.; Jong, Cindy; Sanchez, Wendy B.; Eli, Jennifer A. – Mathematics Teacher Education and Development, 2015
Developing and writing assessment items that measure teachers' knowledge is an intricate and complex undertaking. In this paper, we begin with an overview of what is known about measuring teacher knowledge. We then highlight the challenges inherent in creating assessment items that focus specifically on measuring teachers' specialised knowledge…
Descriptors: Specialization, Knowledge Base for Teaching, Educational Strategies, Testing Problems
Taskinen, Päivi H.; Steimel, Jochen; Gräfe, Linda; Engell, Sebastian; Frey, Andreas – Peabody Journal of Education, 2015
This study examined students' competencies in engineering education at the university level. First, we developed a competency model in one specific field of engineering: process dynamics and control. Then, the theoretical model was used as a frame to construct test items to measure students' competencies comprehensively. In the empirical…
Descriptors: Models, Engineering Education, Test Items, Outcome Measures
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Belmont, John M. – Intelligence, 1983
In an earlier article, Hunt envisions the automation of intelligence testing, but he appears to be overly optimistic. He neglects to mention conceptual and practical difficulties at the interface of measurement and theory that place psychometry not in the dawn of microcomputerization, but rather more nearly in its primordium. (Author)
Descriptors: Editorials, Intelligence, Intelligence Tests, Microcomputers

Calsyn, Donald A.; And Others – Journal of Consulting and Clinical Psychology, 1980
Total errors served as the criterion, and combined errors from the first four subtests served as the predictor. Correlation coefficients of .89 and .88 were obtained in validation and cross-validation phases. The first four subtests provide a suitable, stable estimate of the total score in this population. (Author)
Descriptors: Alcoholism, Neurological Organization, Patients, Predictive Validity
Lord, Frederic M. – 1970
Certain modifications of a conventional test are proposed which force the item difficulty level to adjust automatically to the ability level of the examinee. The modified test is called a flexilevel test. Although different examinees take different sets of items, the scoring method provides comparable scores for all. Furthermore, the test is…
Descriptors: Measurement Techniques, Models, Multiple Choice Tests, Psychometrics
Ekstrom, Ruth B. – 1979
Three areas of concern related to test bias and validity should be considered during the revision of the Standards for Educational and Psychological Tests. The first area concerns the sources and consequences of test bias. Five sources of bias have been identified: numerical bias, role bias, status bias, stereotypic bias, and familiarity bias. The…
Descriptors: Evaluation Criteria, Psychometrics, Test Bias, Test Construction
Merwin, Jack C. – 1979
The term, standardized achievement test, has evolved to imply a broad package of materials which includes a wide array of norms, lists of scoring services and educational objectives tested, and aids to interpretation. Test authors are convinced that that individual differences in achievement do exist; that recognition of these differences will…
Descriptors: Achievement Tests, Elementary Secondary Education, Evaluation Needs, Measurement Objectives
DiBello, Lou; Stout, William – Measurement: Interdisciplinary Research and Perspectives, 2007
In this article, the authors provide their critique on a set of papers that investigated Mathematics Knowledge for Teachers (MKT) assessment and the underlying theory and characteristics of the validity enterprise. Three types of assumptions and inferences--elemental, structural, and ecological--are discussed in these papers. These assumptions…
Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research
Ferrara, Steve – Measurement: Interdisciplinary Research and Perspectives, 2007
In this issue of Measurement: Interdisciplinary Research and Perspectives, Schilling et al. are explicit about the centrality of assessment design and development and psychometric analysis in validation. Schilling and colleagues, Kane (2004, 2006), other contemporary validity theorists and practitioners, and their predecessors typically discuss…
Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research