ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	10

Descriptor

Psychometrics	26
Test Items	26
Testing Problems	26
Test Construction	15
Computer Assisted Testing	6
Difficulty Level	6
Test Validity	6
Item Analysis	5
Item Response Theory	5
Measurement Techniques	5
Test Bias	5
Educational Assessment	4
Higher Education	4
Latent Trait Theory	4
Scores	4
Scoring	4
Test Format	4
Test Reliability	4
College Students	3
Diagnostic Tests	3
Error of Measurement	3
Evaluation Methods	3
Foreign Countries	3
Item Banks	3
Mathematical Models	3
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	2
ETS Research Report Series	1
Educational Technology	1
Electronic Journal of Science…	1
Journal of Educational and…	1
Language Testing	1
Mathematics Teacher Education…	1
National Council on…	1
Peabody Journal of Education	1
Psychology of Women Quarterly	1
Review of Research in…	1
South African Journal of…	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	13
Speeches/Meeting Papers	8
Reports - Evaluative	6
Reports - Descriptive	3
Books	1
Guides - Classroom - Learner	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	3
Elementary Secondary Education	2
Postsecondary Education	2

Audience

Practitioners	1
Researchers	1
Students	1

Location

Colombia	1
Germany	1
Kentucky	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Iowa Tests of Basic Skills	1
New Jersey College Basic…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Assessing Mode Effects of At-Home Testing without a Randomized Trial. Research Report. ETS RR-21-10

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021

In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…

Descriptors: Testing, Distance Education, Comparative Analysis, Test Items

Local Placement Test Retrofit and Building Language Assessment Literacy with Teacher Stakeholders: A Case Study from Colombia

Peer reviewed

Direct link

Janssen, Gerriet – Language Testing, 2022

This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…

Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty

Using Item Response Theory to Improve Locally-Constructed Multiple Choice Tests: Measuring Knowledge Gains and Curricular Effectiveness

Peer reviewed
PDF on ERIC

Download full text

Knell, Janie L.; Wilhoite, Andrea P.; Fugate, Joshua Z.; González-Espada, Wilson J. – Electronic Journal of Science Education, 2015

Current science education reform efforts emphasize teaching K-12 science using hands-on, inquiry activities. For maximum learning and probability of implementation among inservice teachers, these strategies must be modeled in college science courses for preservice teachers. About a decade ago, Morehead State University revised their science…

Descriptors: Item Response Theory, Multiple Choice Tests, Test Construction, Psychometrics

Challenges and Strategies for Assessing Specialised Knowledge for Teaching

Peer reviewed
PDF on ERIC

Download full text

Orrill, Chandra Hawley; Kim, Ok-Kyeong; Peters, Susan A.; Lischka, Alyson E.; Jong, Cindy; Sanchez, Wendy B.; Eli, Jennifer A. – Mathematics Teacher Education and Development, 2015

Developing and writing assessment items that measure teachers' knowledge is an intricate and complex undertaking. In this paper, we begin with an overview of what is known about measuring teacher knowledge. We then highlight the challenges inherent in creating assessment items that focus specifically on measuring teachers' specialised knowledge…

Descriptors: Specialization, Knowledge Base for Teaching, Educational Strategies, Testing Problems

A Competency Model for Process Dynamics and Control and Its Use for Test Construction at University Level

Peer reviewed

Direct link

Taskinen, Päivi H.; Steimel, Jochen; Gräfe, Linda; Engell, Sebastian; Frey, Andreas – Peabody Journal of Education, 2015

This study examined students' competencies in engineering education at the university level. First, we developed a competency model in one specific field of engineering: process dynamics and control. Then, the theoretical model was used as a frame to construct test items to measure students' competencies comprehensively. In the empirical…

Descriptors: Models, Engineering Education, Test Items, Outcome Measures

Testing and Data Integrity in the Administration of Statewide Student Assessment Programs

Download full text

National Council on Measurement in Education, 2012

Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…

Descriptors: State Programs, Integrity, Testing, Test Preparation

Impact of Missing Data on the Detection of Differential Item Functioning: The Case of Mantel-Haenszel and Logistic Regression Analysis

Peer reviewed

Direct link

Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009

This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…

Descriptors: Test Bias, Simulation, Interaction, Effect Size

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Item Bias Issues: Background, Problems, and Where We Are Today.

Diamond, Esther E. – 1981

As test standards and research literature in general indicate, definitions of test bias and item bias vary considerably, as do the results of existing methods of identifying biased items. The situation is further complicated by issues of content, context, construct, and criterion. In achievement tests, for example, content validity may impose…

Descriptors: Achievement Tests, Aptitude Tests, Psychometrics, Test Bias

Testing and Computer-Based Instruction: Psychometric Considerations.

Sarvela, Paul D.; Noonan, John V. – Educational Technology, 1988

Describes measurement problems associated with computer based testing (CBT) programs when they are part of a computer assisted instruction curriculum. Topics discussed include CBT standards; selection of item types; the contamination of items that arise from test design strategies; and the non-equivalence of comparison groups in item analyses. (8…

Descriptors: Computer Assisted Instruction, Computer Assisted Testing, Item Analysis, Psychometrics

A Theoretical Study of the Measurement Effectiveness of Flexilevel Tests

Peer reviewed

Lord, Frederic M. – Educational and Psychological Measurement, 1971

A number of empirical studies are suggested to answer certain questions in connection with flexilevel tests. (MS)

Descriptors: Comparative Analysis, Difficulty Level, Guessing (Tests), Item Analysis

Sex Differences in Mathematics Components of the Iowa Tests of Basic Skills.

Peer reviewed

Plake, Barbara S.; And Others – Psychology of Women Quarterly, 1981

Investigated the Mathematics Problem Solving (MPS) and Mathematics Concepts (MC) subtests of the Iowa Tests of Basic Skills for content and psychometric item bias at grades three, six, and eight. Identified items which favored either males or females. Found no skill classification, item content, or location trends. (Author/JAC)

Descriptors: Elementary Education, Elementary School Students, Mathematics Achievement, Psychometrics

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

Toward a Psychometrics for Testlets.

Peer reviewed

Wainer, Howard; Lewis, Charles – Journal of Educational Measurement, 1990

Three different applications of the testlet concept are presented, and the psychometric models most suitable for each application are described. Difficulties that testlets can help overcome include (1) context effects; (2) item ordering; and (3) content balancing. Implications for test construction are discussed. (SLD)

Descriptors: Algorithms, Computer Assisted Testing, Elementary Secondary Education, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

Smith, Richard M.	2
Wainer, Howard	2
Andrada, Gilbert N.	1
Burstein, Leigh	1
Chen, Yunxiao	1
Cui, Ying	1
Diamond, Esther E.	1
Drasgow, Fritz	1
Eli, Jennifer A.	1
Engell, Sebastian	1
Frey, Andreas	1
Fugate, Joshua Z.	1
González-Espada, Wilson J.	1
Gräfe, Linda	1
Janda, Louis H.	1
Janssen, Gerriet	1
Jong, Cindy	1
Kiely, Gerard L.	1
Kim, Ok-Kyeong	1
Kim, Sooyeon	1
Knell, Janie L.	1
Kuntz, Patricia	1
Lee, Yi-Hsuan	1
Leighton, Jacqueline P.	1
More ▼