Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 8 |
Descriptor
Source
Author
Bardhoshi, Gerta | 1 |
Barkaoui, Khaled | 1 |
Conley, David T. | 1 |
Dolan, Conor V. | 1 |
Dudgeon, Paul | 1 |
Erford, Bradley T. | 1 |
Foster, Jeff L. | 1 |
Gardner, John | 1 |
Hahs-Vaughn, Debbie L. | 1 |
Hamilton, Laura S. | 1 |
Hau, Kit-Tai | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 14 |
Journal Articles | 12 |
Information Analyses | 1 |
Education Level
Elementary Secondary Education | 3 |
Higher Education | 2 |
Adult Education | 1 |
High Schools | 1 |
Audience
Practitioners | 1 |
Researchers | 1 |
Location
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Gardner, John – Oxford Review of Education, 2013
Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…
Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Barkaoui, Khaled – Canadian Modern Language Review, 2007
Essay tests are widely used to assess ESL/EFL learners' writing abilities for instructional, administrative, and research purposes. Relevant literature was searched to identify 70 empirical studies on ESL/EFL essay tests. The majority of these studies examined task, essay, and rater effects on essay rating and scores. Less attention has been given…
Descriptors: Essay Tests, Language Tests, English (Second Language), Second Language Learning
McDonald, Roderick P. – Structural Equation Modeling, 2004
Improper structures arising from the estimation of parameters in structural equation models (SEMs) are commonly an indication that the model is incorrectly specified. The use of boundary solutions cannot in general be recommended. Partly on the basis of theory given by Van Driel, and partly by example, suggestions are made for using the data as…
Descriptors: Structural Equation Models, Evaluation Methods, Error of Measurement, Evaluation Research
Dolan, Conor V.; Wicherts, Jelte M.; Molenaar, Peter C. M. – Structural Equation Modeling, 2004
We consider the question of how variation in the number and reliability of indicators affects the power to reject the hypothesis that the regression coefficients are zero in latent linear regression analysis. We show that power remains constant as long as the coefficient of determination remains unchanged. Any increase in the number of indicators…
Descriptors: Error of Measurement, Factor Analysis, Regression (Statistics), Evaluation Methods
Hox, Joop; Lensvelt-Mulders, Gerty – Structural Equation Modeling, 2004
This article describes a technique to analyze randomized response data using available structural equation modeling (SEM) software. The randomized response technique was developed to obtain estimates that are more valid when studying sensitive topics. The basic feature of all randomized response methods is that the data are deliberately…
Descriptors: Structural Equation Models, Item Response Theory, Evaluation Research, Evaluation Methods
Hahs-Vaughn, Debbie L. – International Journal of Research & Method in Education, 2006
Oversampling and cluster sampling must be addressed when analyzing complex sample data. This study: (a) compares parameter estimates when applying weights versus not applying weights; (b) examines subset selection issues; (c) compares results when using standard statistical software (SPSS) versus specialized software (AM); and (d) offers…
Descriptors: Multivariate Analysis, Sampling, Data Analysis, Error of Measurement
Dudgeon, Paul – Structural Equation Modeling, 2004
This article considers the implications for other noncentrality parameter-based statistics from Steiger's (1998) multiple sample adjustment to the root mean square error of approximation (RMSEA) measure. When a structural equation model is fitted simultaneously in more than 1 sample, it is shown that the calculation of the noncentrality parameter…
Descriptors: Statistical Analysis, Monte Carlo Methods, Structural Equation Models, Error of Measurement
Conley, David T. – Educational Policy Improvement Center (NJ1), 2007
The AP Course Audit utilizes a criterion-based professional judgment method of analysis within a nested multi-step review process. The overall goal of the methodology is to yield a final judgment on each syllabus that is ultimately valid. While reviewer consistency is an important consideration, the most important goal is to reach a final judgment…
Descriptors: Academic Achievement, Compliance (Legal), Course Descriptions, Course Content
Marsh, Herbert W.; Hau, Kit-Tai; Wen, Zhonglin – Structural Equation Modeling, 2004
Goodness-of-fit (GOF) indexes provide "rules of thumb"?recommended cutoff values for assessing fit in structural equation modeling. Hu and Bentler (1999) proposed a more rigorous approach to evaluating decision rules based on GOF indexes and, on this basis, proposed new and more stringent cutoff values for many indexes. This article discusses…
Descriptors: Statistical Significance, Structural Equation Models, Evaluation Methods, Evaluation Research
Saperstein, Aliya – Social Forces, 2006
Social constructivist theories of race suggest no two measures of race will capture the same information, but the degree of "error" this creates for quantitative research on inequality is unclear. Using unique data from the General Social Survey, I find observed and self-reported measures of race yield substantively different results when used to…
Descriptors: Race, Correlation, Income, Educational Attainment
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
McCaffrey, Daniel F.; Lockwood, J. R.; Koretz, Daniel M.; Hamilton, Laura S. – RAND Corporation, 2003
Value-added modeling (VAM) to estimate school and teacher effects is currently of considerable interest to researchers and policymakers. Recent reports suggest that VAM demonstrates the importance of teachers as a source of variance in student outcomes. Policymakers see VAM as a possible component of education reform through improved teacher…
Descriptors: Educational Change, Accountability, Inferences, Models