ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	12

Descriptor

Computer Software	17
Evaluation Methods	17
Item Analysis	17
Foreign Countries	6
Test Items	6
Item Response Theory	5
Test Construction	5
Computer Assisted Testing	4
Models	4
Student Evaluation	4
Computational Linguistics	3
Diagnostic Tests	3
Educational Technology	3
Language Tests	3
Measurement Techniques	3
Scores	3
Second Language Instruction	3
Second Language Learning	3
Achievement Tests	2
Adaptive Testing	2
Comparative Analysis	2
Computer Simulation	2
Computer System Design	2
Culture Fair Tests	2
Distance Education	2
More ▼

Source

Applied Psychological…	3
Educational and Psychological…	2
Educational Technology &…	1
European Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Online Journal…	1
Journal of Educational Data…	1
Language Testing	1
Practical Assessment,…	1
ProQuest LLC	1
Research Matters	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	7
Reports - Descriptive	5
Reports - Evaluative	3
Dissertations/Theses -…	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Adult Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers

Location

China	1
Denmark	1
Iran	1
Taiwan	1
United Kingdom	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Automatic Wordnet Construction and Its Application in Generating Distractors for Cloze Questions

Direct link

Yicheng Sun – ProQuest LLC, 2024

We study how to automatically generate cloze questions from given texts to assess reading comprehension, where a cloze question consists of a stem with a blank space holder for the answer key, and three distractors for generating confusions. We present a generative method called CQG (Cloze Question Generator) for constructing cloze questions from…

Descriptors: Cloze Procedure, Reading Processes, Questioning Techniques, Computational Linguistics

Evaluation of Polytomous Item Locations in Multicomponent Measuring Instruments: A Note on a Latent Variable Modeling Procedure

Peer reviewed

Direct link

Raykov, Tenko; Pusic, Martin – Educational and Psychological Measurement, 2023

This note is concerned with evaluation of location parameters for polytomous items in multiple-component measuring instruments. A point and interval estimation procedure for these parameters is outlined that is developed within the framework of latent variable modeling. The method permits educational, behavioral, biomedical, and marketing…

Descriptors: Item Analysis, Measurement Techniques, Computer Software, Intervals

Investigating Concept Definition and Skill Modeling for Cognitive Diagnosis in Language Learning

Peer reviewed
PDF on ERIC

Download full text

Boxuan Ma; Sora Fukui; Yuji Ando; Shinichi Konomi – Journal of Educational Data Mining, 2024

Language proficiency diagnosis is essential to extract fine-grained information about the linguistic knowledge states and skill mastery levels of test takers based on their performance on language tests. Different from comprehensive standardized tests, many language learning apps often revolve around word-level questions. Therefore, knowledge…

Descriptors: Language Proficiency, Brain Hemisphere Functions, Language Processing, Task Analysis

An Introduction to the Analysis of Ranked Response Data

Peer reviewed
PDF on ERIC

Download full text

Finch, Holmes – Practical Assessment, Research & Evaluation, 2022

Researchers in many disciplines work with ranking data. This data type is unique in that it is often deterministic in nature (the ranks of items "k"-1 determine the rank of item "k"), and the difference in a pair of rank scores separated by "k" units is equivalent regardless of the actual values of the two ranks in…

Descriptors: Data Analysis, Statistical Inference, Models, College Faculty

A Cognitive Diagnostic Assessment Study of the Reading Comprehension Section of the Preliminary English Test (PET)

Peer reviewed
PDF on ERIC

Download full text

Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023

Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…

Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)

On Studying Common Factor Dominance and Approximate Unidimensionality in Multicomponent Measuring Instruments with Discrete Items

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2018

This article outlines a procedure for examining the degree to which a common factor may be dominating additional factors in a multicomponent measuring instrument consisting of binary items. The procedure rests on an application of the latent variable modeling methodology and accounts for the discrete nature of the manifest indicators. The method…

Descriptors: Measurement Techniques, Factor Analysis, Item Response Theory, Likert Scales

Critical Language Assessment Literacy of EFL Teachers: Scale Construction and Validation

Peer reviewed

Direct link

Tajeddin, Zia; Khatib, Mohammad; Mahdavi, Mohsen – Language Testing, 2022

Critical language assessment (CLA) has been addressed in numerous studies. However, the majority of the studies have overlooked the need for a practical framework to measure the CLA dimension of teachers' language assessment literacy (LAL). This gap prompted us to develop and validate a critical language assessment literacy (CLAL) scale to further…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests

Using Corpus Linguistics Tools to Identify Instances of Low Linguistic Accessibility in Tests

Download full text

Beauchamp, David; Constantinou, Filio – Research Matters, 2020

Assessment is a useful process as it provides various stakeholders (e.g., teachers, parents, government, employers) with information about students' competence in a particular subject area. However, for the information generated by assessment to be useful, it needs to support valid inferences. One factor that can undermine the validity of…

Descriptors: Computational Linguistics, Inferences, Validity, Language Usage

A Review of Digital Formative Assessment Tools: Features and Future Directions

Peer reviewed
PDF on ERIC

Download full text

Çekiç, Ahmet; Bakla, Arif – International Online Journal of Education and Teaching, 2021

The Internet and the software stores for mobile devices come with a huge number of digital tools for any task, and those intended for digital formative assessment (DFA) have burgeoned exponentially in the last decade. These tools vary in terms of their functionality, pedagogical quality, cost, operating systems and so forth. Teachers and learners…

Descriptors: Formative Evaluation, Futures (of Society), Computer Assisted Testing, Guidance

A Note on Item-Restscore Association in Rasch Models

Peer reviewed

Direct link

Kreiner, Svend – Applied Psychological Measurement, 2011

To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…

Descriptors: Item Analysis, Correlation, Item Response Theory, Models

Investigation of IRT-Based Equating Methods in the Presence of Outlier Common Items

Peer reviewed

Direct link

Hu, Huiqin; Rogers, W. Todd; Vukmirovic, Zarko – Applied Psychological Measurement, 2008

Common items with inconsistent b-parameter estimates may have a serious impact on item response theory (IRT)--based equating results. To find a better way to deal with the outlier common items with inconsistent b-parameters, the current study investigated the comparability of 10 variations of four IRT-based equating methods (i.e., concurrent…

Descriptors: Item Response Theory, Item Analysis, Computer Simulation, Equated Scores

The Theory about CD-CAT Based on FCA and Its Application

Peer reviewed

Direct link

Shuqun, Yang; Shuliang, Ding; Zhiqiang, Yao – International Journal of Distance Education Technologies, 2009

Cognitive diagnosis (CD) plays an important role in intelligent tutoring system. Computerized adaptive testing (CAT) is adaptive, fair, and efficient, which is suitable to large-scale examination. Traditional cognitive diagnostic test needs quite large number of items, the efficient and tailored CAT could be a remedy for it, so the CAT with…

Descriptors: Monte Carlo Methods, Distance Education, Adaptive Testing, Intelligent Tutoring Systems

DIFAS: Differential Item Functioning Analysis System. Computer Program Exchange

Peer reviewed

Direct link

Penfield, Randall D. – Applied Psychological Measurement, 2005

Differential item functioning (DIF) is an important consideration in assessing the validity of test scores (Camilli & Shepard, 1994). A variety of statistical procedures have been developed to assess DIF in tests of dichotomous (Hills, 1989; Millsap & Everson, 1993) and polytomous (Penfield & Lam, 2000; Potenza & Dorans, 1995) items. Some of these…

Descriptors: Test Bias, Item Analysis, Psychological Studies, Evaluation Methods

Multiple Evaluation: A New Testing Paradigm that Exorcizes Guessing

Peer reviewed

Direct link

Dirkzwager, Arie – International Journal of Testing, 2003

The crux in psychometrics is how to estimate the probability that a respondent answers an item correctly on one occasion out of many. Under the current testing paradigm this probability is estimated using all kinds of statistical techniques and mathematical modeling. Multiple evaluation is a new testing paradigm using the person's own personal…

Descriptors: Psychometrics, Probability, Models, Measurement

Developing Formative Assessments for Postgraduate Students in Engineering

Peer reviewed

Direct link

Burrow, Michael; Evdorides, Harry; Hallam, Barbara; Freer-Hewish, Richard – European Journal of Engineering Education, 2005

This paper outlines an approach taken to produce computer-based formative assessments for two modules in a one-year taught MSc programme in Road Management and Engineering. It presents the aims of the assessments, the taxonomy adopted to ensure that the formulation of the questions addressed learning outcomes related to the development of higher…

Descriptors: Evaluation Methods, Formative Evaluation, Psychometrics, Engineering Education

Previous Page | Next Page »

Pages: 1 | 2

Raykov, Tenko	2
Alghazali, Tawfeeq	1
Bakla, Arif	1
Beauchamp, David	1
Boxuan Ma	1
Burrow, Michael	1
Constantinou, Filio	1
Dawood, Abdul Kareem Shareef	1
Dirkzwager, Arie	1
Evdorides, Harry	1
Finch, Holmes	1
Freer-Hewish, Richard	1
Hallam, Barbara	1
Harnisch, Delwyn L.	1
Hu, Huiqin	1
Kadhim, Qasim Khlaif	1
Khatib, Mohammad	1
Kreiner, Svend	1
Liu, Chao-Lin	1
Mahdavi, Mohsen	1
Marcoulides, George A.	1
Mohammed, Aisha	1
Penfield, Randall D.	1
Pusic, Martin	1
Rogers, W. Todd	1
More ▼