ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	13

Descriptor

Item Analysis	38
Test Reliability	38
Test Theory	38
Test Validity	18
Test Construction	16
Test Items	13
Career Development	10
Latent Trait Theory	9
Mathematical Models	8
Test Interpretation	8
Error of Measurement	7
Criterion Referenced Tests	6
Factor Analysis	6
Item Response Theory	6
Comparative Analysis	5
Difficulty Level	5
Foreign Countries	5
Item Sampling	5
Measurement Techniques	5
Testing	5
Achievement Tests	4
Correlation	4
Higher Education	4
Language Tests	4
Norm Referenced Tests	4
More ▼

Source

Applied Psychological…	2
Advances in Health Sciences…	1
Assessment & Evaluation in…	1
Assessment for Effective…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Intellectual &…	1
Journal on Educational…	1
Language Teaching Research…	1
Online Submission	1
ProQuest LLC	1
Psychometrika	1
SAGE Open	1
School Psychology Review	1
More ▼

Publication Type

Reports - Research	22
Journal Articles	16
Reports - Descriptive	5
Reports - Evaluative	4
Speeches/Meeting Papers	3
Books	2
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - Non-Classroom	1
Opinion Papers	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	5
Adult Education	2
Elementary Secondary Education	2
Postsecondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 1	1
Kindergarten	1
Preschool Education	1
Primary Education	1

Audience

Researchers	2
Practitioners	1
Students	1
Teachers	1

Location

Finland (Helsinki)	1
Singapore	1
South Africa	1
Spain	1
Texas	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Armed Services Vocational…	1
Dyadic Adjustment Scale	1
Expressive One Word Picture…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 38 results Save | Export

The PSI-20: Development of a Viable Short Form Alternative of the Problem Solving Inventory Using Item Response Theory

Peer reviewed

Direct link

Tyrone B. Pretorius; P. Paul Heppner; Anita Padmanabhanunni; Serena Ann Isaacs – SAGE Open, 2023

In previous studies, problem solving appraisal has been identified as playing a key role in promoting positive psychological well-being. The Problem Solving Inventory is the most widely used measure of problem solving appraisal and consists of 32 items. The length of the instrument, however, may limit its applicability to large-scale surveys…

Descriptors: Problem Solving, Measures (Individuals), Test Construction, Item Response Theory

Concurrent Validity of LLAMA_F: Measure of Language Analytic Ability as a Predictor of Morphosyntax Knowledge

Peer reviewed
PDF on ERIC

Download full text

Kim, Peter – Language Teaching Research Quarterly, 2021

Foreign language aptitude is defined as one's potential to learn a second language. A language learner with higher aptitude is predicted to learn more, faster, and reach a higher level of proficiency. If this is the case, one way to validate the construct of aptitude and its measure is to conduct a validation study in which measures of aptitude is…

Descriptors: Morphology (Languages), Syntax, Second Language Learning, Second Language Instruction

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

The Number of Feedbacks Needed for Reliable Evaluation. A Multilevel Analysis of the Reliability, Stability and Generalisability of Students' Evaluation of Teaching

Peer reviewed

Direct link

Rantanen, Pekka – Assessment & Evaluation in Higher Education, 2013

A multilevel analysis approach was used to analyse students' evaluation of teaching (SET). The low value of inter-rater reliability stresses that any solid conclusions on teaching cannot be made on the basis of single feedbacks. To assess a teacher's general teaching effectiveness, one needs to evaluate four randomly chosen course implementations.…

Descriptors: Test Reliability, Feedback (Response), Generalizability Theory, Student Evaluation of Teacher Performance

Application of the Rasch Rating Scale Model to the Assessment of Quality of Life of Persons with Intellectual Disability

Peer reviewed

Direct link

Gomez, Laura E.; Arias, Benito; Verdugo, Miguel Angel; Navas, Patricia – Journal of Intellectual & Developmental Disability, 2012

Background: Most instruments that assess quality of life have been validated by means of the classical test theory (CTT). However, CTT limitations have resulted in the development of alternative models, such as the Rasch rating scale model (RSM). The main goal of this paper is testing and improving the psychometric properties of the INTEGRAL…

Descriptors: Evidence, Models, Mental Retardation, Quality of Life

Accessibility Theory for Enhancing the Validity of Test Results for Students with Special Needs

Peer reviewed

Direct link

Beddow, Peter A. – International Journal of Disability, Development and Education, 2012

In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…

Descriptors: Test Results, Test Items, Educational Testing, Scores

Item-Level and Construct Evaluation of Early Numeracy Curriculum-Based Measures

Peer reviewed

Direct link

Lee, Young-Sun; Lembke, Erica; Moore, Douglas; Ginsburg, Herbert P.; Pappas, Sandra – Assessment for Effective Intervention, 2012

The present study examined the technical adequacy of curriculum-based measures (CBMs) of early numeracy. Six 1-min early mathematics tasks were administered to 137 kindergarten and first-grade students, along with an omnibus test of early mathematics. The CBM measures included Count Out Loud, Quantity Discrimination, Number Identification, Missing…

Descriptors: Numeracy, Curriculum Based Assessment, Mathematics Tests, Kindergarten

Measurement Theory in Language Testing: Past Traditions and Current Trends

Peer reviewed
PDF on ERIC

Download full text

Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Measurement Theory in Language Testing: Past Traditions and Current Trends

Download full text

Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Are Multiple Choice Tests Fair to Medical Students with Specific Learning Disabilities?

Peer reviewed

Direct link

Ricketts, Chris; Brice, Julie; Coombes, Lee – Advances in Health Sciences Education, 2010

The purpose of multiple choice tests of medical knowledge is to estimate as accurately as possible a candidate's level of knowledge. However, concern is sometimes expressed that multiple choice tests may also discriminate in undesirable and irrelevant ways, such as between minority ethnic groups or by sex of candidates. There is little literature…

Descriptors: Medical Students, Testing Accommodations, Ethnic Groups, Learning Disabilities

Quantifying Emotional Intelligence in Relationships: The Validation of the Relationship Skills Map

Direct link

Cox, Judith Ellen – ProQuest LLC, 2010

Emotional intelligence in relationships can be developed and enhanced through the use of an assessment instrument within a mentoring or counseling relationship. The Relationship Skills Map (RSM) has been created for this purpose. This study concerns the validation of the Relationship Skills Map. Participants in this study included members of a…

Descriptors: Stress Management, Graduate Students, Emotional Intelligence, Time Management

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

A Comparative Study of Indices for Internal Consistency.

Peer reviewed

Cudeck, Robert – Journal of Educational Measurement, 1980

Methods for evaluating the consistency of responses to test items were compared. When a researcher is unwilling to make the assumptions of classical test theory, has only a small number of items, or is in a tailored testing context, Cliff's dominance indices may be useful. (Author/CTM)

Descriptors: Error Patterns, Item Analysis, Test Items, Test Reliability

A Review of the Beta-Binomial Model and Its Extensions.

Peer reviewed

Wilcox, Rand R. – Journal of Educational Statistics, 1981

Both the binomial and beta-binomial models are applied to various problems occurring in mental test theory. The paper reviews and critiques these models. The emphasis is on the extensions of the models that have been proposed in recent years, and that might not be familiar to many educators. (Author)

Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3

Haladyna, Tom	2
Salmani-Nodoushan, Mohammad…	2
Algina, James	1
Altepeter, Tom	1
Anita Padmanabhanunni	1
Arias, Benito	1
Bashaw, W. L.	1
Beddow, Peter A.	1
Bentler, P. M.	1
Bernknopf, Stanley	1
Bichi, Ado Abdu	1
Brice, Julie	1
Bullock, Lyndal M.	1
Chase, Clinton I.	1
Cliff, Norman	1
Cohen, Allan S., Comp.	1
Cook, Linda L.	1
Coombes, Lee	1
Cox, Judith Ellen	1
Crocker, Linda	1
Cudeck, Robert	1
Elosua, Paula	1
Epstein, Kenneth I.	1
Forster, Fred	1
More ▼