Help | Advanced Search

Computer Science > Computation and Language

Title: realm: reference resolution as language modeling.

Abstract: Reference resolution is an important problem, one that is essential to understand and successfully handle context of different kinds. This context includes both previous turns and context that pertains to non-conversational entities, such as entities on the user's screen or those running in the background. While LLMs have been shown to be extremely powerful for a variety of tasks, their use in reference resolution, particularly for non-conversational entities, remains underutilized. This paper demonstrates how LLMs can be used to create an extremely effective system to resolve references of various types, by showing how reference resolution can be converted into a language modeling problem, despite involving forms of entities like those on screen that are not traditionally conducive to being reduced to a text-only modality. We demonstrate large improvements over an existing system with similar functionality across different types of references, with our smallest model obtaining absolute gains of over 5% for on-screen references. We also benchmark against GPT-3.5 and GPT-4, with our smallest model achieving performance comparable to that of GPT-4, and our larger models substantially outperforming it.

Submission history

Access paper:.

  • HTML (experimental)
  • Other Formats

References & Citations

  • Google Scholar
  • Semantic Scholar

BibTeX formatted citation

BibSonomy logo

Bibliographic and Citation Tools

Code, data and media associated with this article, recommenders and search tools.

  • Institution

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs .

CNN values your feedback

Fear & Greed Index

Latest Market News

Teachers are using AI to grade essays. But some experts are raising ethical concerns

Samantha Murphy Kelly

When Diane Gayeski, a professor of strategic communications at Ithaca College, receives an essay from one of her students, she runs part of it through ChatGPT, asking the AI tool to critique and suggest how to improve the work.

“The best way to look at AI for grading is as a teaching assistant or research assistant who might do a first pass … and it does a pretty good job at that,” she told CNN.

She shows her students the feedback from ChatGPT and how the tool rewrote their essay. “I’ll share what I think about their intro, too, and we’ll talk about it,” she said.

Gayeski requires her class of 15 students to do the same: run their draft through ChatGPT to see where they can make improvements.

The emergence of AI is reshaping education, presenting real benefits, such as automating some tasks to free up time for more personalized instruction, but also some big hazards, from issues around accuracy and plagiarism to maintaining integrity.

Both teachers and students are using the new technology. A report by strategy consultant firm Tyton Partners, sponsored by plagiarism detection platform Turnitin, found half of college students used AI tools in Fall 2023. Meanwhile, while fewer faculty members used AI, the percentage grew to 22% of faculty members in the fall of 2023, up from 9% in spring 2023.

Teachers are turning to AI tools and platforms — such as ChatGPT, Writable, Grammarly and EssayGrader — to assist with grading papers, writing feedback, developing lesson plans and creating assignments. They’re also using the burgeoning tools to create quizzes, polls, videos and interactives to up the ante” for what’s expected in the classroom.

Students, on the other hand, are leaning on tools such as ChatGPT and Microsoft CoPilot — which is built into Word, PowerPoint and other products.

But while some schools have formed policies on how students can or can’t use AI for schoolwork, many do not have guidelines for teachers. The practice of using AI for writing feedback or grading assignments also raises ethical considerations. And parents and students who are already spending hundreds of thousands of dollars on tuition may wonder if an endless feedback loop of AI-generated and AI-graded content in college is worth the time and money.

“If teachers use it solely to grade, and the students are using it solely to produce a final product, it’s not going to work,” said Gayeski.

The time and place for AI

How teachers use AI depends on many factors, particularly when it comes to grading, according to Dorothy Leidner, a professor of business ethics at the University of Virginia. If the material being tested in a large class is largely declarative knowledge — so there is a clear right and wrong — then a teacher grading using the AI “might be even superior to human grading,” she told CNN.

AI would allow teachers to grade papers faster and more consistently and avoid fatigue or boredom, she said.

But Leidner noted when it comes to smaller classes or assignments with less definitive answers, grading should remain personalized so teachers can provide more specific feedback and get to know a student’s work, and, therefore, progress over time.

“A teacher should be responsible for grading but can give some responsibility to the AI,” she said.

She suggested teachers use AI to look at certain metrics — such as structure, language use and grammar — and give a numerical score on those figures. But teachers should then grade students’ work themselves when looking for novelty, creativity and depth of insight.

Leslie Layne teaches her students how to best use ChatGPT but takes issue with how some educators are using it to grade papers.

Leslie Layne, who has been teaching ChatGPT best practices in her writing workshop at the University of Lynchburg in Virginia, said she sees the advantages for teachers but also sees drawbacks.

“Using feedback that is not truly from me seems like it is shortchanging that relationship a little,” she said.

She also sees uploading a student’s work to ChatGPT as a “huge ethical consideration” and potentially a breach of their intellectual property. AI tools like ChatGPT use such entries to train their algorithms on everything from patterns of speech to how to make sentences to facts and figures.

Ethics professor Leidner agreed, saying this should particularly be avoided for doctoral dissertations and master’s theses because the student might hope to publish the work.

“It would not be right to upload the material into the AI without making the students aware of this in advance,” she said. “And maybe students should need to provide consent.”

Some teachers are leaning on software called Writable that uses ChatGPT to help grade papers but is “tokenized,” so essays do not include any personal information, and it’s not shared directly with the system.

Teachers upload essays to the platform, which was recently acquired by education company Houghton Mifflin Harcourt, which then provides suggested feedback for students.

Other educators are using platforms such as  Turnitin  that boast plagiarism detection tools to help teachers identify when assignments are written by ChatGPT and other AI. But these types of detection tools are far from foolproof; OpenAI shut down its own AI-detection tool last year due to what the company called a “low rate of accuracy.”

Setting standards

Some schools are actively working on policies for both teachers and students. Alan Reid, a research associate in the Center for Research and Reform in Education (CRRE) at Johns Hopkins University, said he recently spent time working with K-12 educators who use GPT tools to create end-of-quarter personalized comments on report cards.

But like Layne, he acknowledged the technology’s ability to write insightful feedback remains “limited.”

He currently sits on a committee at his college that’s authoring an AI policy for faculty and staff; discussions are ongoing, not just for how teachers use AI in the classroom but how it’s used by educators in general.

He acknowledges schools are having conversations about using generative AI tools to create things like promotion and tenure files, performance reviews, and job postings.”

Nicolas Frank, an associate professor of philosophy at University of Lynchburg, said universities and professors need to be on the same page when it comes to policies but need to stay cautious .

“There is a lot of danger in making policies about AI at this stage,” he said.

He worries it’s still too early to understand how AI will be integrated into everyday life. He is also concerned that some administrators who don’t teach in classrooms may craft policy that misses nuances of instruction.

“That may create a danger of oversimplifying the problems with AI use in grading and instruction,” he said. “Oversimplification is how bad policy is made.”

To start, he said educators can identify clear abuses of AI and begin policy-making around those.

Leidner, meanwhile, said universities can be very high level with their guidance, such as making transparency a priority — so students have a right to know when AI is being used to grade their work — and identifying what types of information should never be uploaded into an AI or asked of an AI.

But she said universities must also be open to “regularly reevaluating as the technology and uses evolve.”

CNN Business Videos

anderson cooper andy cohen king charles

Show all

'.concat(e,"

'.concat(i,"

\n Find more topics that matter to you on your Follow page. Browse, add, or remove topics for a\n personalized experience.\n

",16,{name:"compare",hash:{},data:o,blockParams:r,loc:{start:{line:12,column:78},end:{line:13,column:31}}}),{name:"if",hash:{},fn:l.program(3,o,0,r,t),inverse:l.noop,data:o,blockParams:r,loc:{start:{line:12,column:72},end:{line:13,column:81}}}))?c:"")+'">'+s(i(null!=(c=r[0][0])?u(c,"label"):c,a))+"

To revisit this article, visit My Profile, then View saved stories .

  • Backchannel
  • Newsletters
  • WIRED Insider
  • WIRED Consulting

Amanda Hoover

Students Are Likely Writing Millions of Papers With AI

Illustration of four hands holding pencils that are connected to a central brain

Students have submitted more than 22 million papers that may have used generative AI in the past year, new data released by plagiarism detection company Turnitin shows.

A year ago, Turnitin rolled out an AI writing detection tool that was trained on its trove of papers written by students as well as other AI-generated texts. Since then, more than 200 million papers have been reviewed by the detector, predominantly written by high school and college students. Turnitin found that 11 percent may contain AI-written language in 20 percent of its content, with 3 percent of the total papers reviewed getting flagged for having 80 percent or more AI writing. (Turnitin is owned by Advance, which also owns Condé Nast, publisher of WIRED.) Turnitin says its detector has a false positive rate of less than 1 percent when analyzing full documents.

ChatGPT’s launch was met with knee-jerk fears that the English class essay would die . The chatbot can synthesize information and distill it near-instantly—but that doesn’t mean it always gets it right. Generative AI has been known to hallucinate , creating its own facts and citing academic references that don’t actually exist. Generative AI chatbots have also been caught spitting out biased text on gender and race . Despite those flaws, students have used chatbots for research, organizing ideas, and as a ghostwriter . Traces of chatbots have even been found in peer-reviewed, published academic writing .

Teachers understandably want to hold students accountable for using generative AI without permission or disclosure. But that requires a reliable way to prove AI was used in a given assignment. Instructors have tried at times to find their own solutions to detecting AI in writing, using messy, untested methods to enforce rules , and distressing students. Further complicating the issue, some teachers are even using generative AI in their grading processes.

Detecting the use of gen AI is tricky. It’s not as easy as flagging plagiarism, because generated text is still original text. Plus, there’s nuance to how students use gen AI; some may ask chatbots to write their papers for them in large chunks or in full, while others may use the tools as an aid or a brainstorm partner.

Students also aren't tempted by only ChatGPT and similar large language models. So-called word spinners are another type of AI software that rewrites text, and may make it less obvious to a teacher that work was plagiarized or generated by AI. Turnitin’s AI detector has also been updated to detect word spinners, says Annie Chechitelli, the company’s chief product officer. It can also flag work that was rewritten by services like spell checker Grammarly, which now has its own generative AI tool . As familiar software increasingly adds generative AI components, what students can and can’t use becomes more muddled.

Detection tools themselves have a risk of bias. English language learners may be more likely to set them off; a 2023 study found a 61.3 percent false positive rate when evaluating Test of English as a Foreign Language (TOEFL) exams with seven different AI detectors. The study did not examine Turnitin’s version. The company says it has trained its detector on writing from English language learners as well as native English speakers. A study published in October found that Turnitin was among the most accurate of 16 AI language detectors in a test that had the tool examine undergraduate papers and AI-generated papers.

This Woman Will Decide Which Babies Are Born

Lauren Goode

The Best Total Solar Eclipse Photos

Karen Williams

The Hacking Lawsuit Looming Over Truth Social

William Turton

Schools that use Turnitin had access to the AI detection software for a free pilot period, which ended at the start of this year. Chechitelli says a majority of the service’s clients have opted to purchase the AI detection. But the risks of false positives and bias against English learners have led some universities to ditch the tools for now. Montclair State University in New Jersey announced in November that it would pause use of Turnitin’s AI detector. Vanderbilt University and Northwestern University did the same last summer.

“This is hard. I understand why people want a tool,” says Emily Isaacs, executive director of the Office of Faculty Excellence at Montclair State. But Isaacs says the university is concerned about potentially biased results from AI detectors, as well as the fact that the tools can’t provide confirmation the way they can with plagiarism. Plus, Montclair State doesn’t want to put a blanket ban on AI, which will have some place in academia. With time and more trust in the tools, the policies could change. “It’s not a forever decision, it’s a now decision,” Isaacs says.

Chechitelli says the Turnitin tool shouldn’t be the only consideration in passing or failing a student. Instead, it’s a chance for teachers to start conversations with students that touch on all of the nuance in using generative AI. “People don’t really know where that line should be,” she says.

You Might Also Like …

In your inbox: The best and weirdest stories from WIRED’s archive

Jeffrey Epstein’s island visitors exposed by data broker

8 Google employees invented modern AI. Here’s the inside story

The crypto fraud kingpin who almost got away

It's shadow time! How to view the solar eclipse, online and in person

research paper of education pdf

Steven Levy

Perplexity's Founder Was Inspired by Sundar Pichai. Now They’re Competing to Reinvent Search

Kate Knibbs

Inside the Creation of the World’s Most Powerful Open Source AI Model

Will Knight

How to Stop Your Data From Being Used to Train AI

Matt Burgess

To Build a Better AI Supercomputer, Let There Be Light

Benj Edwards, Ars Technica

IMAGES

  1. (PDF) PEDAGOGY IN HIGHER EDUCATION

    research paper of education pdf

  2. How to Write and Publish a Research Paper.pdf

    research paper of education pdf

  3. (PDF) How to Write An Effective Research Proposal For Higher Degree

    research paper of education pdf

  4. Scope Of Educational Technology B Ed Notes

    research paper of education pdf

  5. 38+ Research Paper Samples

    research paper of education pdf

  6. (PDF) A Research Paper on Social media: An Innovative Educational Tool

    research paper of education pdf

VIDEO

  1. 💯Introduction to Research Methods 💥M.ed Examination🔥First Semester Previous Year Paper👍

  2. Research guidelines and Article format II Private Batch II

  3. How to search Elsevier Interdisciplinary journal with IMPACT FACTOR and publish for free #elsevier

  4. How to chat with articles or research papers by SciSpace and do hours of research in minutes

  5. Impact of Educational Research: P David Pearson, PhD

  6. How to Write a Research Paper using ChatGPT & Bard AI

COMMENTS

  1. Research Papers in Education: Vol 39, No 2 (Current issue)

    A structured discussion of the fairness of GCSE and A level grades in England in summer 2020 and 2021. et al. Article | Published online: 18 Feb 2024. Explore the current issue of Research Papers in Education, Volume 39, Issue 2, 2024.

  2. (PDF) Educational Research: Educational Purposes, The Nature of

    Licensed Under Creative Commons Attribution CC BY. 1. Educational Research: Educational Purposes, The Nature of Knowledge and Ethical Issues. Julio López-Alvarado. Association for the Promotion ...

  3. PDF Students' Perceptions towards the Quality of Online Education: A

    861. Students' Perceptions towards the Quality of Online Education: A Qualitative Approach. Yi Yang Linda F. Cornelius Mississippi State University. Abstract. How to ensure the quality of online learning in institutions of higher education has been a growin g concern during the past several years.

  4. PDF Understanding the Purpose of Higher Education: an Analysis of The

    The ultimate goal is to develop renovation or repurposing strategy across competing imperatives and to outline success measures to critically define, measure, and evaluate the achievement of specific goals and outcomes in hopes of resolving potential skills mismatch in a world of massive cataclysmic change.

  5. Systems Research in Education: Designs and methods

    This exploratory paper seeks to shed light on the methodological challenges of education systems research. There is growing consensus that interventions to improve learning outcomes must be designed and studied as part of a broader system of education, and that learning outcomes are affected by a complex web of dynamics involving different inputs, actors, processes and socio-political contexts.

  6. PDF The Concept of Quality in Education: a Review of The 'International

    The Department of Education, University of Bath, UK The Institute for Educational Planning and Administration, University of Cape Coast, Ghana The Faculty of Education, University of Dar es Salaam, Tanzania The Kigali Institute of Education, Rwanda The Education Policy Unit, University of the Witwatersrand, Johannesburg, South Africa.

  7. PDF ISSN (Online): 2398-3760 Educational Research: Educational Purposes

    education should be a means to give people more freedom, and to build a better society for all, to become more human. The idea that education would provide a better future and better jobs was challenged by Brown et al. (2011) in their seminal book, The Global Auction, where they discussed about the broken promise of education. ...

  8. (Pdf) Exploring Current Trends in Education: a Review of Research

    The complexity of special education and the variability among students Autism Spectrum Disorder (ASD) require special education teachers to make a concerted effort to provide validated supports ...

  9. (PDF) THE PURPOSE OF EDUCATION

    Higher education as part of the individual education process is a significant step in the formatting of lifelong learners (UNESCO, n.d.). Education fosters critical thinking skills, helps students ...

  10. PDF Education for Sustainability: Quality Education Is A Necessity in

    Quality in education is a multi-dimensional concept with different components (Sallis, 2002). According to some researchers the definitions of quality are: Quality is fulfilling & exceeding customer's needs, Quality is everyone's job and quality is continuous improvement. Quality is recognition and reward.

  11. PDF INTRODUCTION TO EDUCATIONAL RESEARCH

    We will progress to explore what is meant by both 'education' and 'research', through exploring the different contexts in which they occur. By this, we will explore contemporary issues and trends in educational research to highlight a range of research, from both an applied perspective and an academic perspective. This chapter

  12. PDF Education Technology: An Evidence-Based Review

    education landscape. Revolutionary advances in information and communications technology (ICT)—particularly disciplines associated with computers, mobile phones, and the Internet— have precipitated a renaissance in education technology (ed-tech), a term we use here to refer to any ICT application that aims to improve education.

  13. PDF Working Paper 27392 http://www.nber.org/papers/w27392

    education, choice of major, etc.). Our results underscore the fact that the COVID-19 shock is likely to exacerbate socioeconomic disparities in higher education. This is consistent with ndings regarding the impacts of COVID-19 on K-12 students. Kuhfeld et al.,2020project that school closures are likely to lead to signi cant learning losses in math

  14. PDF Philosophical Foundation of Education

    The philosophical foundation of education is a crucial aspect of the field of education. Philosophical inquiry has played a significant role in shaping educational theories, practices, and policies. This research paper aims to explore the philosophical foundation of education, its key concepts, and its significance in the field of

  15. ERIC

    ERIC is an online library of education research and information, sponsored by the Institute of Education Sciences (IES) of the U.S. Department of Education.

  16. PDF Journal of Indian Education

    102 Journal of Indian Education November 2019. 14210 Upper Primary schools in the year 2012-13, then the number decreased, and by the year 2016-17, the number of Upper Primary schools reached to 11884. Table 2 Management wise Distribution of Elementary Schools in 2016-17 Type Primary Upper Primary Total.

  17. PDF Technology and Education: Computers, Software, and the Internet

    Although technology is a broad term, the paper focuses on the effects of computers, the Internet, and software such as computer assisted instruction, which are currently the most. relevant forms of new technology in education.3 The discussion focuses primarily on the impacts. of computers, the Internet and software on educational outcomes ...

  18. (PDF) Educational Inequality

    arXiv:2204.04701v1 [econ.GN] 10 Apr 2022. Educational Inequality *. Jo Blanden Matthias Doepke Jan Stuhler. April 2022. Abstract. This chapter provides new evidence on ed ucational inequality and ...

  19. PDF Poverty in Education

    Description of Poverty and the Role of Education. Poverty can best be described as a family of four or more whose average yearly. income falls below the federal poverty level of $22,050. In order for families to make. ends meet research shows that approximately twice the income of the federal poverty. level is needed.

  20. PDF New Jersey Department of Education's High-Impact Tutoring Resource Release

    The New Jersey Department of Education (NJDOE) is excited to announce the release of guidance titled, High- Impact Tutoring: An Evidence-Based Strategy to Accelerate Learning. This resource provides information on the benefits and essential design elements of effective, high-impact tutoring programs to support local education agencies' (LEAs ...

  21. [2403.20329] ReALM: Reference Resolution As Language Modeling

    ReALM: Reference Resolution As Language Modeling. Reference resolution is an important problem, one that is essential to understand and successfully handle context of different kinds. This context includes both previous turns and context that pertains to non-conversational entities, such as entities on the user's screen or those running in the ...

  22. PDF United States House Committee on Education and the

    Further, the leadership at the Department of Education seems tone-deaf. On March 15, 2024, when most schools had received at most a handful or records, Secretary Cardona wrote a letter to school presidents that in part implied that it was schools who were unprepared to receive ISIRs and prepare awards.

  23. Teachers are using AI to grade essays. Students are using AI to write

    Meanwhile, while fewer faculty members used AI, the percentage grew to 22% of faculty members in the fall of 2023, up from 9% in spring 2023. Teachers are turning to AI tools and platforms ...

  24. Students Are Likely Writing Millions of Papers With AI

    Since then, more than 200 million papers have been reviewed by the detector, predominantly written by high school and college students. Turnitin found that 11 percent may contain AI-written ...

  25. PDF DEPARTMENT OF EDUCATION Statement by Miguel Cardona Secretary of

    Education research and data are important because high-quality information about effective practices and trends in student achievement can help improve teaching and learning, student outcomes, and the return on the public investment in education at the Federal, State, and local levels.

  26. (PDF) The Importance of Education

    The Importance of Education. Education is an important issue in one's life. It is the key to success in the future, and t o. have many opportunities in our life. Education has many advantages ...

  27. National Grade 6 Mock Assessment # 2 Science paper 1

    National Grade 6 Mock Assessment # 2 Science paper 1 Popular. National Grade 6 Mock Assessment # 2 Science paper 1. Popular. Published on 28 March 2024 Modified on 02 April 2024 758 downloads. Download (pdf, 286 KB) Final Mock 2 2024 Science Paper 1.pdf.

  28. PDF New Jersey Department of Education

    This is a survey for parents of school-age students receiving special education services (kindergarten through high school). Your responses will help guide efforts to improve services and results for children and families. For each statement below, please select one of

  29. (PDF) Impact of modern technology in education

    Importance of technolog y in education. The role of technology in the field of education is four-. fold: it is included as a part of the curriculum, as an. instructional delivery system, as a ...