ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	10

Source

ProQuest LLC	3
ETS Research Report Series	2
Educational and Psychological…	2
Online Submission	2
American Educational Research…	1
Assessment	1
Evaluation and the Health…	1
International Educational…	1
Language Learning & Language…	1

Publication Type

Journal Articles	7
Reports - Research	7
Dissertations/Theses -…	3
Reports - Evaluative	3
Speeches/Meeting Papers	2
Collected Works - Proceedings	1
Information Analyses	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Elementary Education	2
Grade 8	2
Junior High Schools	2
Middle Schools	2
Secondary Education	2
Elementary Secondary Education	1
Grade 6	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Location

Hong Kong	4
Australia	1
China	1
Finland	1
France	1
Iran	1
Italy	1
Japan	1
Kuwait	1
Norway	1
Singapore	1
Taiwan	1
Tunisia	1
United Kingdom (England)	1
United Kingdom (Scotland)	1
United States	1
More ▼

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

Graduate Record Examinations	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Extreme Response Style: Which Model Is Best?

Direct link

Leventhal, Brian – ProQuest LLC, 2017

More robust and rigorous psychometric models, such as multidimensional Item Response Theory models, have been advocated for survey applications. However, item responses may be influenced by construct-irrelevant variance factors such as preferences for extreme response options. Through empirical and simulation methods, this study evaluates the use…

Descriptors: Psychometrics, Item Response Theory, Simulation, Models

Evidence of the Generalization and Construct Representation Inferences for the "GRE"® Revised General Test Sentence Equivalence Item Type. ETS GRE® Board Research Report. ETS GRE®-17-02. ETS Research Report. RR-17-05

Peer reviewed
PDF on ERIC

Download full text

Bejar, Isaac I.; Deane, Paul D.; Flor, Michael; Chen, Jing – ETS Research Report Series, 2017

The report is the first systematic evaluation of the sentence equivalence item type introduced by the "GRE"® revised General Test. We adopt a validity framework to guide our investigation based on Kane's approach to validation whereby a hierarchy of inferences that should be documented to support score meaning and interpretation is…

Descriptors: College Entrance Examinations, Graduate Study, Generalization, Inferences

Looking for Agreement among Criteria Used to Determine Teacher Effectiveness in Two Different Evaluation Models

Direct link

McGair, Charles D. – ProQuest LLC, 2012

Many theories, methods, and practices are utilized to evaluate teachers with the intention of determining teacher effectiveness to better inform decisions about retention, tenure, certification and performance-based pay. In the 21st century there has been a renewed emphasis on teacher evaluation in public schools, largely due to federal "Race…

Descriptors: Teacher Effectiveness, Models, Standards, Teacher Evaluation

The Internal/External Frame of Reference Model of Self-Concept and Achievement Relations: Age-Cohort and Cross-Cultural Differences

Peer reviewed

Direct link

Marsh, Herbert W.; Abduljabbar, Adel Salah; Parker, Philip D.; Morin, Alexandre J. S.; Abdelfattah, Faisal; Nagengast, Benjamin; Möller, Jens; Abu-Hilal, Maher M. – American Educational Research Journal, 2015

The internal/external frame of reference (I/E) model and dimensional comparison theory posit paradoxical relations between achievement (ACH) and self-concept (SC) in mathematics (M) and verbal (V) domains; ACH in each domain positively affects SC in the matching domain (e.g., MACH to MSC) but negatively in the nonmatching domain (e.g., MACH to…

Descriptors: Self Concept, Cultural Differences, Academic Achievement, Comparative Analysis

Complexity, Accuracy, Fluency and Lexis in Task-Based Performance: A Synthesis of the Ealing Research

Peer reviewed

Direct link

Skehan, Peter; Foster, Pauline – Language Learning & Language Teaching (MS), 2012

This chapter will present a research synthesis of a series of studies, termed here the Ealing research. The studies use the same general framework to conceptualise tasks and task performance, enabling easier comparability. The different studies, although each is self-contained, build into a wider picture of task performance. The major point of…

Descriptors: Language Fluency, Linguistic Performance, Task Analysis, Guidelines

Reliability Generalization: An Examination of the Positive Affect and Negative Affect Schedule

Peer reviewed

Direct link

Leue, Anja; Lange, Sebastian – Assessment, 2011

The assessment of positive affect (PA) and negative affect (NA) by means of the Positive Affect and Negative Affect Schedule has received a remarkable popularity in the social sciences. Using a meta-analytic tool--namely, reliability generalization (RG)--population reliability scores of both scales have been investigated on the basis of a random…

Descriptors: Social Sciences, True Scores, Generalization, Affective Behavior

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Justifying the Use of a Second Language Oral Test as an Exit Test in Hong Kong: An Application of Assessment Use Argument Framework

Direct link

Jia, Yujie – ProQuest LLC, 2013

This study employed Bachman and Palmer's (2010) Assessment Use Argument framework to investigate to what extent the use of a second language oral test as an exit test in a Hong Kong university can be justified. It also aimed to help test developers of this oral test identify the most critical areas in the current test design that might need…

Descriptors: Test Use, Language Tests, Oral Language, Second Language Learning

A Multivariate Generalizability Model for Clinical Skills Assessments

Peer reviewed

Direct link

Jarjoura, David; Early, Larry; Androulakakis, Voula – Educational and Psychological Measurement, 2004

Assessments of clinical skills of medical students rely increasingly on standardized patients demonstrating medical cases with faculty rating performance. The common finding of inconsistency of scores across cases is often referred to as case specificity. A multivariate generalizability model reveals that overall case specificity cannot explain…

Descriptors: Patients, Medical Students, Clinical Experience, Physician Patient Relationship

The Assessment of Professional Competence.

Peer reviewed

Kane, Michael T. – Evaluation and the Health Professions, 1992

A proposed model for the validity of measures of professional competence treats validation as the evaluation of inferences drawn from test scores, focusing on evaluation, generalization, and extrapolation. The model is used to indicate strengths and weaknesses of assessments of professional competence: observations of performance, simulations, and…

Descriptors: Competence, Evaluation Methods, Generalization, Inferences

Self-Concept and Mathematics Achievement: Modeling the Relationship under the Language Pressure in Hong Kong

Download full text

Wang, Jianjun – Online Submission, 2004

Located at a meeting place between the West and the East, Hong Kong has been chosen in this comparative investigation to reconfirm a theoretical model of "reciprocal relationship" between mathematics achievement and self-concept using the 8th grade databases from TIMSS and TIMSS-R. During the time between these two projects, Hong Kong…

Descriptors: Mathematics Achievement, Foreign Countries, Language of Instruction, Self Concept

An Empirical Study of Relationships between Student Self-Concept and Science Achievement in Hong Kong

Download full text

Wang, Jianjun; Oliver, Steve; Garcia, Augustine – Online Submission, 2004

Positive self-concept and good understanding of science are important indicators of scientific literacy endorsed by professional organizations. The existing research literature suggests that these two indicators are reciprocally related and mutually reinforcing. Generalization of the reciprocal model demands empirical studies in different…

Descriptors: Foreign Countries, Language of Instruction, Science Achievement, Scientific Literacy

Proceedings of the Seventh International Conference on Educational Data Mining (EDM) (7th, London, United Kingdom, July 4-7, 2014)

Download full text

Stamper, John, Ed.; Pardos, Zachary, Ed.; Mavrikis, Manolis, Ed.; McLaren, Bruce M., Ed. – International Educational Data Mining Society, 2014

The 7th International Conference on Education Data Mining held on July 4th-7th, 2014, at the Institute of Education, London, UK is the leading international forum for high-quality research that mines large data sets in order to answer educational research questions that shed light on the learning process. These data sets may come from the traces…

Descriptors: Information Retrieval, Data Processing, Data Analysis, Data Collection

Generalization	14
Models	14
Scores	14
Evaluation Methods	6
Foreign Countries	6
Statistical Analysis	6
Item Response Theory	5
Simulation	5
Comparative Analysis	4
Correlation	4
Test Items	4
English (Second Language)	3
Error of Measurement	3
Gender Differences	3
Item Analysis	3
Mathematics Achievement	3
Prediction	3
Regression (Statistics)	3
Sample Size	3
Second Language Learning	3
Self Concept	3
Academic Achievement	2
Affective Behavior	2
Computer Assisted Testing	2
Computer Software	2
More ▼

Wang, Jianjun	2
Abdelfattah, Faisal	1
Abduljabbar, Adel Salah	1
Abu-Hilal, Maher M.	1
Androulakakis, Voula	1
Bejar, Isaac I.	1
Breyer, F. Jay	1
Chen, Jing	1
Deane, Paul D.	1
Early, Larry	1
Flor, Michael	1
Foster, Pauline	1
Garcia, Augustine	1
Jarjoura, David	1
Jia, Yujie	1
Kane, Michael T.	1
Lange, Sebastian	1
Leue, Anja	1
Leventhal, Brian	1
Lorenz, Florian	1
Marsh, Herbert W.	1
Mavrikis, Manolis, Ed.	1
McGair, Charles D.	1
McLaren, Bruce M., Ed.	1
Morin, Alexandre J. S.	1
More ▼