NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers2
What Works Clearinghouse Rating
Does not meet standards1
Showing 1 to 15 of 48 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Wirth, Astrid; Stadler, Matthias; Birtwistle, Efsun; Niklas, Frank – Journal of Educational Psychology, 2023
The Home Learning Environment (HLE) focuses on everyday learning habits in families to support the development of children's early cognitive competencies. A growing number of studies have assessed the HLE by using different conceptual approaches and various assessment methods, often focusing on either the home literacy environment or the home…
Descriptors: Home Study, Educational Environment, Family Environment, Outcomes of Education
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Park, Siwon – Journal of Pan-Pacific Association of Applied Linguistics, 2017
This paper examines how different test methods may tap different aspects of second language knowledge. It employs multiple-choice (MC) and constructed response (CR) items which yield distinct or convergent information in the computer delivered testing of English in its presentation of this factor. In order to examine the effects of test method, a…
Descriptors: Evaluation Methods, Second Language Learning, English (Second Language), Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Ngo, Federick; Kwon, William W. – Research in Higher Education, 2015
Community college students are often placed in developmental math courses based on the results of a single placement test. However, concerns about accurate placement have recently led states and colleges across the country to consider using other measures to inform placement decisions. While the relationships between college outcomes and such…
Descriptors: Access to Education, Success, Community Colleges, Mathematics Education
Peer reviewed Peer reviewed
Direct linkDirect link
Rojahn, Johannes; Schroeder, Stephen R.; Mayo-Ortega, Liliana; Oyama-Ganiko, Rosao; LeBlanc, Judith; Marquis, Janet; Berke, Elizabeth – Research in Developmental Disabilities: A Multidisciplinary Journal, 2013
Reliable and valid assessment of aberrant behaviors is essential in empirically verifying prevention and intervention for individuals with intellectual or developmental disabilities (IDD). Few instruments exist which assess behavior problems in infants. The current longitudinal study examined the performance of three behavior-rating scales for…
Descriptors: Rating Scales, Behavior Problems, Developmental Disabilities, Infants
Peer reviewed Peer reviewed
Direct linkDirect link
Rodgers, Joseph Lee; Rodgers, Jacci L. – Journal of Continuing Higher Education, 2011
We propose, develop, and evaluate the black ink-red ink (BIRI) method of testing. This approach uses two different methods within the same test administration setting, one that matches recognition learning and the other that matches recall learning. Students purposively define their own tradeoff between the two approaches. Evaluation of the method…
Descriptors: Testing, Test Anxiety, Recall (Psychology), Recognition (Psychology)
Peer reviewed Peer reviewed
Direct linkDirect link
Quellmalz, Edys S.; Davenport, Jodi L.; Timms, Michael J.; DeBoer, George E.; Jordan, Kevin A.; Huang, Chun-Wei; Buckley, Barbara C. – Journal of Educational Psychology, 2013
How can assessments measure complex science learning? Although traditional, multiple-choice items can effectively measure declarative knowledge such as scientific facts or definitions, they are considered less well suited for providing evidence of science inquiry practices such as making observations or designing and conducting investigations.…
Descriptors: Science Education, Educational Assessment, Psychometrics, Science Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Rhemtulla, Mijke; Brosseau-Liard, Patricia E.; Savalei, Victoria – Psychological Methods, 2012
A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category…
Descriptors: Factor Analysis, Computation, Simulation, Sample Size
Peer reviewed Peer reviewed
Direct linkDirect link
White, C. Stephen; Fox, Rebecca K.; Isenberg, Joan P. – European Journal of Teacher Education, 2011
This study examined how multiple measures can be used to study experienced teachers' learning. The study was conducted in an advanced Master's degree programme, aligned with the National Board for Professional Teaching Standards in the United States. The programmatic features and key learning experiences found in the programme are described and…
Descriptors: National Standards, Faculty Development, Masters Degrees, Investigations
Peer reviewed Peer reviewed
Direct linkDirect link
Heath, Barbara; Lakshmanan, Aruna; Perlmutter, Aaron; Davis, Lori – International Journal of Research & Method in Education, 2010
Instrument choice is a crucial part of evaluation of professional development programmes. The use of multiple evaluation methods helps in triangulation, and offers insight into the developmental sequence involved in the changes in teacher beliefs and practice. Most current instruments are self-contained and not designed for use in conjunction with…
Descriptors: Evaluation Needs, Literature Reviews, Evaluation Methods, Professional Development
Peer reviewed Peer reviewed
Direct linkDirect link
Geiser, Christian; Eid, Michael; Nussbeck, Fridtjof W.; Courvoisier, Delphine S.; Cole, David A. – Developmental Psychology, 2010
The authors show how structural equation modeling can be applied to analyze change in longitudinal multitrait-multimethod (MTMM) studies. For this purpose, an extension of latent difference models (McArdle, 1988; Steyer, Eid, & Schwenkmezger, 1997) to multiple constructs and multiple methods is presented. The model allows investigators to separate…
Descriptors: Structural Equation Models, Multitrait Multimethod Techniques, Validity, Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
LaGrange, Beth; Cole, David A. – Structural Equation Modeling: A Multidisciplinary Journal, 2008
This article examines 4 approaches for explaining shared method variance, each applied to a longitudinal trait-state-occasion (TSO) model. Many approaches have been developed to account for shared method variance in multitrait-multimethod (MTMM) data. Some of these MTMM approaches (correlated method, orthogonal method, correlated method minus one,…
Descriptors: Structural Equation Models, Longitudinal Studies, Multitrait Multimethod Techniques, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Sadler, D. Royce – Assessment & Evaluation in Higher Education, 2009
When assessment tasks are set for students in universities and colleges, a common practice is to advise them of the criteria that will be used for grading their responses. Various schemes for using multiple criteria have been widely advocated in the literature. Each scheme is designed to offer clear benefits for students. Breaking down holistic…
Descriptors: Student Evaluation, Grading, Evaluation Criteria, Evaluation Problems
Peer reviewed Peer reviewed
Direct linkDirect link
Doumen, Sarah; Verschueren, Karine; Buyse, Evelien; De Munter, Sofie; Max, Kristel; Moens, Loth – Infant and Child Development, 2009
Two studies extended psychometric research on the Student-Teacher Relationship Scale (STRS) with kindergarten and preschool children (N[subscript 1] = 60-7[subscript 1]; N[subscript 2] = 35) and their teachers. These studies used a multi-method approach to replicate and extend previous findings concerning the convergent validity of the STRS…
Descriptors: Conflict, Validity, Preschool Children, Kindergarten
Peer reviewed Peer reviewed
Direct linkDirect link
Shujuan, Wang; Meihua, Qian; Jianxin, Zhang – Journal of Psychoeducational Assessment, 2009
This article examines the psychometric structure of the Anxiety Control Questionnaire (ACQ) in Chinese adolescents. With the data collected from 212 senior high school students (94 females, 110 males, 8 unknown), seven models are tested using confirmatory factor analyses in the framework of the multitrait-multimethod strategy. Results indicate…
Descriptors: Multitrait Multimethod Techniques, Factor Structure, Adolescents, Measures (Individuals)
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4