One Hundred Year Study on Artificial Intelligence (AI100)

Conclusions

Main navigation, related documents.

2019 Workshops

2020 Study Panel Charge

Download Full Report  

AAAI 2022 Invited Talk

Stanford HAI Seminar 2023

The field of artificial intelligence has made remarkable progress in the past five years and is having real-world impact on people, institutions and culture. The ability of computer programs to perform sophisticated language- and image-processing tasks, core problems that have driven the field since its birth in the 1950s, has advanced significantly. Although the current state of AI technology is still far short of the field’s founding aspiration of recreating full human-like intelligence in machines, research and development teams are leveraging these advances and incorporating them into society-facing applications. For example, the use of AI techniques in healthcare is becoming a reality, and the brain sciences are both a beneficiary of and a contributor to AI advances. Old and new companies are investing money and attention to varying degrees to find ways to build on this progress and provide services that scale in unprecedented ways.

The field’s successes have led to an inflection point: It is now urgent to think seriously about the downsides and risks that the broad application of AI is revealing. The increasing capacity to automate decisions at scale is a double-edged sword; intentional deepfakes or simply unaccountable algorithms making mission-critical recommendations can result in people being misled, discriminated against, and even physically harmed. Algorithms trained on historical data are disposed to reinforce and even exacerbate existing biases and inequalities. Whereas AI research has traditionally been the purview of computer scientists and researchers studying cognitive processes, it has become clear that all areas of human inquiry, especially the social sciences, need to be included in a broader conversation about the future of the field. Minimizing the negative impacts on society and enhancing the positive requires more than one-shot technological solutions; keeping AI on track for positive outcomes relevant to society requires ongoing engagement and continual attention.

Looking ahead, a number of important steps need to be taken. Governments play a critical role in shaping the development and application of AI, and they have been rapidly adjusting to acknowledge the importance of the technology to science, economics, and the process of governing itself. But government institutions are still behind the curve, and sustained investment of time and resources will be needed to meet the challenges posed by rapidly evolving technology. In addition to regulating the most influential aspects of AI applications on society, governments need to look ahead to ensure the creation of informed communities. Incorporating understanding of AI concepts and implications into K-12 education is an example of a needed step to help prepare the next generation to live in and contribute to an equitable AI-infused world.

The AI research community itself has a critical role to play in this regard, learning how to share important trends and findings with the public in informative and actionable ways, free of hype and clear about the dangers and unintended consequences along with the opportunities and benefits. AI researchers should also recognize that complete autonomy is not the eventual goal for AI systems. Our strength as a species comes from our ability to work together and accomplish more than any of us could alone. AI needs to be incorporated into that community-wide system, with clear lines of communication between human and automated decision-makers. At the end of the day, the success of the field will be measured by how it has empowered all people, not by how efficiently machines devalue the very people we are trying to help.

Cite This Report

Michael L. Littman, Ifeoma Ajunwa, Guy Berger, Craig Boutilier, Morgan Currie, Finale Doshi-Velez, Gillian Hadfield, Michael C. Horowitz, Charles Isbell, Hiroaki Kitano, Karen Levy, Terah Lyons, Melanie Mitchell, Julie Shah, Steven Sloman, Shannon Vallor, and Toby Walsh. "Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report." Stanford University, Stanford, CA, September 2021. Doc:  http://ai100.stanford.edu/2021-report. Accessed: September 16, 2021.

Report Authors

AI100 Standing Committee and Study Panel  

© 2021 by Stanford University. Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report is made available under a Creative Commons Attribution-NoDerivatives 4.0 License (International):  https://creativecommons.org/licenses/by-nd/4.0/ .

FIU Libraries Logo

  •   LibGuides
  •   A-Z List
  •   Help

Artificial Intelligence

  • Background Information
  • Getting started
  • Browse Journals
  • Dissertations & Theses
  • Datasets and Repositories
  • Research Data Management 101
  • Scientific Writing
  • Find Videos
  • Related Topics
  • Quick Links
  • Ask Us/Contact Us

FIU dissertations

dissertations on ai

Non-FIU dissertations

Many   universities   provide full-text access to their dissertations via a digital repository.  If you know the title of a particular dissertation or thesis, try doing a Google search.  

Aims to be the best possible resource for finding open access graduate theses and dissertations published around the world with metadata from over 800 colleges, universities, and research institutions. Currently, indexes over 1 million theses and dissertations.

This is a discovery service for open access research theses awarded by European universities.

A union catalog of Canadian theses and dissertations, in both electronic and analog formats, is available through the search interface on this portal.

There are currently more than 90 countries and over 1200 institutions represented. CRL has catalog records for over 800,000 foreign doctoral dissertations.

An international collaborative resource, the NDLTD Union Catalog contains more than one million records of electronic theses and dissertations. Use BASE, the VTLS Visualizer or any of the geographically specific search engines noted lower on their webpage.

Indexes doctoral dissertations and masters' theses in all areas of academic research includes international coverage.

ProQuest Dissertations & Theses global

Related Sites

dissertations on ai

  • << Previous: Browse Journals
  • Next: Datasets and Repositories >>
  • Last Updated: Apr 4, 2024 8:33 AM
  • URL: https://library.fiu.edu/artificial-intelligence

Information

Fiu libraries floorplans, green library, modesto a. maidique campus, hubert library, biscayne bay campus.

Federal Depository Library Program logo

Directions: Green Library, MMC

Directions: Hubert Library, BBC

Advertisement

Advertisement

Artificial Intelligence in Education (AIEd): a high-level academic and industry note 2021

  • Original Research
  • Open access
  • Published: 07 July 2021
  • Volume 2 , pages 157–165, ( 2022 )

Cite this article

You have full access to this open access article

dissertations on ai

  • Muhammad Ali Chaudhry   ORCID: orcid.org/0000-0003-0154-2613 1 &
  • Emre Kazim 2  

42k Accesses

61 Citations

24 Altmetric

Explore all metrics

In the past few decades, technology has completely transformed the world around us. Indeed, experts believe that the next big digital transformation in how we live, communicate, work, trade and learn will be driven by Artificial Intelligence (AI) [ 83 ]. This paper presents a high-level industrial and academic overview of AI in Education (AIEd). It presents the focus of latest research in AIEd on reducing teachers’ workload, contextualized learning for students, revolutionizing assessments and developments in intelligent tutoring systems. It also discusses the ethical dimension of AIEd and the potential impact of the Covid-19 pandemic on the future of AIEd’s research and practice. The intended readership of this article is policy makers and institutional leaders who are looking for an introductory state of play in AIEd.

Similar content being viewed by others

dissertations on ai

The Birth of IJAIED

Evolution and revolution in artificial intelligence in education.

dissertations on ai

The Future Development of Education in the Era of Artificial Intelligence

Avoid common mistakes on your manuscript.

1 Introduction

Artificial Intelligence (AI) is changing the world around us [ 42 ]. As a term it is difficult to define even for experts because of its interdisciplinary nature and evolving capabilities. In the context of this paper, we define AI as a computer system that can achieve a particular task through certain capabilities (like speech or vision) and intelligent behaviour that was once considered unique to humans [ 54 ]. In more lay terms we use the term AI to refer to intelligent systems that can automate tasks traditionally carried out by humans. Indeed, we read AI within the continuation of the digital age, with increased digital transformation changing the ways in which we live in the world. With such change the skills and knowhow of people must reflect the new reality and within this context, the World Economic Forum identified sixteen skills, referred to as twenty-first century skills necessary for the future workforce [ 79 ]. This includes skills such as technology literacy, communication, leadership, curiosity, adaptability, etc. These skills have always been important for a successful career, however, with the accelerated digital transformation of the past 2 years and the focus on continuous learning in most professional careers, these skills are becoming necessary for learners.

AI will play a very important role in how we teach and learn these new skills. In one dimension, ‘AIEd’ has the potential to dramatically automate and help track the learner’s progress in all these skills and identify where best a human teacher’s assistance is needed. For teachers, AIEd can potentially be used to help identify the most effective teaching methods based on students’ contexts and learning background. It can automate monotonous operational tasks, generate assessments and automate grading and feedback. AI does not only impact what students learn through recommendations, but also how they learn, what are the learning gaps, which pedagogies are more effective and how to retain learner’s attention. In these cases, teachers are the ‘human-in-the-loop’, where in such contexts, the role of AI is only to enable more informed decision making by teachers, by providing them predictions about students' performance or recommending relevant content to students after teachers' approval. Here, the final decision makers are teachers.

Segal et al. [ 58 ] developed a system named SAGLET that utilized ‘human-in-the-loop’ approach to visualize and model students’ activities to teachers in real-time enabling them to intervene more effectively as and when needed. Here the role of AI is to empower the teachers enabling them to enhance students’ learning outcomes. Similarly, Rodriguez et al. [ 52 ] have shown how teachers as ‘human-in-the-loop’ can customize multimodal learning analytics and make them more effective in blended learning environments.

Critically, all these achievements are completely dependent on the quality of available learner data which has been a long-lasting challenge for ed-tech companies, at least until the pandemic. Use of technology in educational institutions around the globe is increasing [ 77 ], however, educational technology (ed-tech) companies building AI powered products have always complained about the lack of relevant data for training algorithms. The advent and spread of Covid in 2019 around the world pushed educational institutions online and left them at the mercy of ed-tech products to organize content, manage operations, and communicate with students. This shift has started generating huge amounts of data for ed-tech companies on which they can build AI systems. According to a joint report: ‘Shock to the System’, published by Educate Ventures and Cambridge University, optimism of ed-tech companies about their own future increased during the pandemic and their most pressing concern became recruitment of too many customers to serve effectively [ 15 ].

Additionally, most of the products and solutions provided by ed-tech start-ups lack the quality and resilience to cope with intensive use of several thousands of users. Product maturity is not ready for huge and intense demand as discussed in Sect. “ Latest research ” below. We also discuss some of these products in detail in Sect. “ Industry’s focus ” below. How do we mitigate the risks of these AI powered products and who monitors the risk? (we return to this theme in our discussion of ethics—Sect. “ Ethical AIEd ”).

This paper is a non-exhaustive overview of AI in Education that presents a brief survey of the latest developments of AI in Education. It begins by discussing different aspects of education and learning where AI is being utilized, then turns to where we see the industry’s current focus and then closes with a note on ethical concerns regarding AI in Education. This paper also briefly evaluates the potential impact of the pandemic on AI’s application in education. The intended readership of this article is the policy community and institutional executives seeking an instructive introduction to the state of play in AIEd. The paper can also be read as a rapid introduction to the state of play of the field.

2 Latest research

Most work within AIEd can be divided into four main subdomains. In this section, we survey some of the latest work in each of these domains as case studies:

Reducing teachers’ workload: the purpose of AI in Education is to reduce teachers’ workload without impacting learning outcomes

Contextualized learning for students: as every learner has unique learning needs, the purpose of AI in Education is to provide customized and/or personalised learning experiences to students based on their contexts and learning backgrounds.

Revolutionizing assessments: the purpose of AI in Education is to enhance our understanding of learners. This not only includes what they know, but also how they learn and which pedagogies work for them.

Intelligent tutoring systems (ITS): the purpose of AI in Education is to provide intelligent learning environments that can interact with students, provide customized feedback and enhance their understanding of certain topics

2.1 Reducing teachers’ workload

Recent research in AIEd is focusing more on teachers than other stakeholders of educational institutions, and this is for the right reasons. Teachers are at the epicenter of every learning environment, face to face or virtual. Participatory design methodologies ensure that teachers are an integral part of the design of new AIEd tools, along with parents and learners [ 45 ]. Reducing teachers’ workload has been a long-lasting challenge for educationists, hoping to achieve more affective teaching in classrooms by empowering the teachers and having them focus more on teaching than the surrounding activities.

With the focus on online education during the pandemic and emergence of new tools to facilitate online learning, there is a growing need for teachers to adapt to these changes. Importantly, teachers themselves are having to re-skill and up-skill to adapt to this age, i.e. the new skills that teachers need to develop to fully utilize the benefits of AIEd [ 39 ]. First, they need to become tech savvy to understand, evaluate and adapt new ed-tech tools as they become available. They may not necessarily use these tools, but it is important to have an understanding of what these tools offer and if they share teachers’ workload. For example, Zoom video calling has been widely used during the pandemic to deliver lessons remotely. Teachers need to know not only how to schedule lessons on Zoom, but also how to utilize functionalities like breakout rooms to conduct group work and Whiteboard for free style writing. Second, teachers will also need to develop analytical skills to interpret the data that are visualized by these ed-tech tools and to identify what kind of data and analytics tools they need to develop a better understanding of learners. This will enable teachers to get what they exactly need from ed-tech companies and ease their workload. Third, teachers will also need to develop new team working, group and management skills to accommodate new tools in their daily routines. They will be responsible for managing these new resources most efficiently.

Selwood and Pilkington [ 61 ] showed that the use of Information and Communication Technologies (ICT) leads to a reduction in teachers’ workload if they use it frequently, receive proper training on how to use ICT and have access to ICT in home and school. During the pandemic, teachers have been left with no options other than online teaching. Spoel et al. [ 76 ] have shown that the previous experience with ICT did not play a significant role in how they dealt with the online transition during pandemic. Suggesting that the new technologies are not a burden for teachers. It is early to draw any conclusions on the long-term effects of the pandemic on education, online learning and teachers’ workload. Use of ICT during the pandemic may not necessarily reduce teacher workload, but change its dynamics.

2.2 Contextualized learning for students

Every learner has unique learning contexts based on their prior knowledge about the topic, social background, economic well-being and emotional state [ 41 ]. Teaching is most effective when tailored to these changing contexts. AIEd can help in identifying the learning gaps in each learner, offer content recommendations based on that and provide step by step solutions to complex problems. For example, iTalk2Learn is an opensource platform that was developed by researchers to support math learning among students between 5 and 11 years of age [ 22 ]. This tutor interacted with students through speech, identified when students were struggling with fractions and intervened accordingly. Similarly, Pearson has launched a calculus learning tool called AIDA that provides step by step guidance to students and helps them complete calculus tasks. Use of such tools by young students also raises interesting questions about the illusion of empathy that learners may develop towards such educational bots [ 73 ].

Open Learner Models [ 12 , 18 ] have been widely used in AIEd to facilitate learners, teachers and parents in understanding what learners know, how they learn and how AI is being used to enhance learning. Another important construct in understanding learners is self-regulated learning [ 10 , 68 ]. Zimmerman and Schunk [ 85 ] define self-regulated learning as learner’s thoughts, feelings and actions towards achieving a certain goal. Better understanding of learners through open learner models and self-regulated learning is the first step towards contextualized learning in AIEd. Currently, we do not have completely autonomous digital tutors like Amazon’s Alexa or Apple’s Siri for education but domain specific Intelligent Tutoring Systems (ITS) are also very helpful in identifying how much students know, where they need help and what type of pedagogies would work for them.

There are a number of ed-tech tools available to develop basic literacy skills in learners like double digit division or improving English grammar. In future, AIEd powered tools will move beyond basic literacy to develop twenty-first century skills like curiosity [ 49 ], initiative and creativity [ 51 ], collaboration and adaptability [ 36 ].

2.3 Revolutionizing assessments

Assessment in educational context refers to ‘any appraisal (or judgement or evaluation) of a student’s work or performance’ [ 56 ]. Hill and Barber [ 27 ] have identified assessments as one of the three pillars of schooling along with curriculum and learning and teaching. The purpose of modern assessments is to evaluate what students know, understand and can do. Ideally, assessments should take account of the full range of student abilities and provide useful information about learning outcomes. However, every learner is unique and so are their learning paths. How can standardized assessment be used to evaluate every student, with distinct capabilities, passions and expertise is a question that can be posed to broader notions of educational assessment. According to Luckin [ 37 ] from University College London, ‘AI would provide a fairer, richer assessment system that would evaluate students across a longer period of time and from an evidence-based, value-added perspective’.

AIAssess is an example of an intelligent assessment tool that was developed by researchers at UCL Knowledge lab [ 38 , 43 ]. It assessed students learning math and science based on three models: knowledge model, analytics model and student model. Knowledge component stored the knowledge about each topic, the analytics component analyzed students’ interactions and the student model tracked students’ progress on a particular topic. Similarly, Samarakou et al. [ 57 ] have developed an AI assessment tool that also does qualitative evaluation of students to reduce the workload of instructors who would otherwise spend hours evaluating every exercise. Such tools can be further empowered by machine learning techniques such as semantic analysis, voice recognition, natural language processing and reinforcement learning to improve the quality of assessments.

2.4 Intelligent tutoring systems (ITS)

An intelligent tutoring system is a computer program that tries to mimic a human teacher to provide personalized learning to students [ 46 , 55 ]. The concept of ITS in AIEd is decades old [ 9 ]. There have always been huge expectations from ITS capabilities to support learning. Over the years, we have observed that there has been a significant contrast between what ITS were envisioned to deliver and what they have actually been capable of doing [ 4 ].

A unique combination of domain models [ 78 ], pedagogical models [ 44 ] and learner models [ 20 ] were expected to provide contextualized learning experiences to students with customized content, like expert human teachers [ 26 , 59 , 65 ],. Later, more models were introduced to enhance students' learning experience like strategy model, knowledge-base model and communication model [ 7 ]. It was expected that an intelligent tutoring system would not just teach, but also ensure that students have learned. It would care for students [ 17 ]. Similar to human teachers, ITS would improve with time. They would learn from their experiences, ‘understand’ what works in which contexts and then help students accordingly [ 8 , 60 ].

In recent years, ITS have mostly been subject and topic specific like ASSISTments [ 25 ], iTalk2Learn [ 23 ] and Aida Calculus. Despite being limited in terms of the domain that a particular intelligent tutoring system addresses, they have proven to be effective in providing relevant content to students, interacting with students [ 6 ] and improving students’ academic performance [ 18 , 41 ]. It is not necessary that ITS would work in every context and facilitate every teacher [ 7 , 13 , 46 , 48 ]. Utterberg et al. [78] showed why teachers have abandoned technology in some instances because it was counterproductive. They conducted a formative intervention with sixteen secondary school mathematics teachers and found systemic contradictions between teachers’ opinions and ITS recommendations, eventually leading to the abandonment of the tool. This highlights the importance of giving teachers the right to refuse AI powered ed-tech if they are not comfortable with it.

Considering a direct correlation between emotions and learning [ 40 ] recently, ITS have also started focusing on emotional state of students while learning to offer a more contextualized learning experience [ 24 ].

2.5 Popular conferences

To reflect on the increasing interest and activity in the space of AIEd, some of the most popular conferences in AIEd are shown in Table 1 below. Due to the pandemic all these conferences will be available virtually in 2021 as well. The first international workshop on multimodal artificial intelligence in education is being organized at AIEd [74] conference to promote the importance of multimodal data in AIEd.

3 Industry’s focus

In this section, we introduce the industry focus in the area of AIEd by case-studying three levels of companies start-up level, established/large company and mega-players (Amazon, Cisco). These companies represent different levels of the ecosystem (in terms of size).

3.1 Start-ups

There have been a number of ed-tech companies that are leading the AIEd revolution. New funds are also emerging to invest in ed-tech companies and to help ed-tech start-ups in scaling their products. There has been an increase in investor interest [ 21 ]. In 2020 the amount of investment raised by ed-tech companies more than doubled compared to 2019 (according to Techcrunch). This shows another dimension of pandemic’s effect on ed-tech. With an increase in data coming in during the pandemic, it is expected that industry’s focus on AI powered products will increase.

EDUCATE, a leading accelerator focused on ed-tech companies supported by UCL Institute of Education and European Regional Development Fund was formed to bring research and evidence at the centre of product development for ed-tech. This accelerator has supported more than 250 ed-tech companies and 400 entrepreneurs and helped them focus on evidence-informed product development for education.

Number of ed-tech companies are emerging in this space with interesting business models. Third Space Learning offers maths intervention programs for primary and secondary school students. The company aims to provide low-cost quality tuition to support pupils from disadvantaged backgrounds in UK state schools. They have already offered 8,00,000 h of teaching to around 70,000 students, 50% of who were eligible for free meals. Number of mobile apps like Kaizen Languages, Duolingo and Babbel have emerged that help individuals in learning other languages.

3.2 Established players

Pearson is one of the leading educational companies in the world with operations in more than 70 countries and more than 22,000 employees worldwide. They have been making a transition to digital learning and currently generate 66% of their annual revenue from it. According to Pearson, they have built world’s first AI powered calculus tutor called Aida which is publicly available on the App Store. But, its effectiveness in improving students’ calculus skills without any human intervention is still to be seen.

India based ed-tech company known for creating engaging educational content for students raised investment at a ten billion dollar valuation last year [ 70 ]. Century tech is another ed-tech company that is empowering learning through AI. They claim to use neuroscience, learning science and AI to personalize learning and identifying the unique learning pathways for students in 25 countries. They make more than sixty thousand AI powered smart recommendations to learners every day.

Companies like Pearson and Century Tech are building great technology that is impacting learners across the globe. But the usefulness of their acclaimed AI in helping learners from diverse backgrounds, with unique learning needs and completely different contexts is to be proven. As discussed above, teachers play a very important role on how their AI is used by learners. For this, teacher training is vital to fully understand the strengths and weaknesses of these products. It is very important to have an awareness of where these AI products cannot help or can go wrong so teachers and learners know when to avoid relying on them.

In the past few years, the popularity of Massive Online Open Courses (MOOCS) has grown exponentially with the emergence of platforms like Coursera, Udemy, Udacity, LinkedIn Learning and edX [ 5 , 16 , 28 ]. AI can be utilized to develop a better understanding of learner behaviour on MOOCS, produce better content and enhance learning outcomes at scale. Considering these platforms are collecting huge amounts of data, it will be interesting to see the future applications of AI in offering personalized learning and life-long learning solutions to their users [ 81 ].

3.3 Mega-players

Seeing the business potential of AIEd and the kind of impact it can have on the future of humanity, some of the biggest tech companies around the globe are moving into this space. The shift to online education during the pandemic boosted the demand for cloud services. Amazon’s AWS (Amazon Web Services) as a leader in cloud services provider facilitated institutions like Instituto Colombiano para la Evaluacion de la Educacion (ICFES) to scale their online examination service for 70,000 students. Similarly, LSE utilized AWS to scale their online assessments for 2000 students [ 1 , 3 ].

Google’s CEO Sunder Pichai stated that the pandemic offered an incredible opportunity to re-imagine education. Google has launched more than 50 new software tools during the pandemic to facilitate remote learning. Google Classroom which is a part of Google Apps for Education (GAFE) is being widely used by schools around the globe to deliver education. Research shows that it improves class dynamics and helps with learner participation [ 2 , 29 , 62 , 63 , 69 ].

Before moving onto the ethical dimensions of AIEd, it is important to conclude this section by noting an area that is of critical importance to processing industry and services. Aside from these three levels of operation (start-up, medium, and mega companies), there is the question of development of the AIEd infrastructure. As Luckin [41] points out, “True progress will require the development of an AIEd infrastructure. This will not, however, be a single monolithic AIEd system. Instead, it will resemble the marketplace that has been developed for smartphone apps: hundreds and then thousands of individual AIEd components, developed in collaboration with educators, conformed to uniform international data standards, and shared with researchers and developers worldwide. These standards will enable system-level data collation and analysis that help us learn much more about learning itself and how to improve it”.

4 Ethical AIEd

With a number of mishaps in the real world [ 31 , 80 ], ethics in AI has become a real concern for AI researchers and practitioners alike. Within computer science, there is a growing overlap with the broader Digital Ethics [ 19 ] and the ethics and engineering focused on developing Trustworthy AI [ 11 ]. There is a focus on fairness, accountability, transparency and explainability [ 33 , 82 , 83 , 84 ]. Ethics in AI needs to be embedded in the entire development pipeline, from the decision to start collecting data till the point when the machine learning model is deployed in production. From an engineering perspective, Koshiyama et al. [ 35 ] have identified four verticals of algorithmic auditing. These include performance and robustness, bias and discrimination, interpretability and explainability and algorithmic privacy.

In education, ethical AI is crucial to ensure the wellbeing of learners, teachers and other stakeholders involved. There is a lot of work going on in AIEd and AI powered ed-tech tools. With the influx of large amounts of data due to online learning during the pandemic, we will most likely see an increasing number of AI powered ed-tech products. But ethics in AIEd is not a priority for most ed-tech companies and schools. One of the reasons for this is the lack of awareness of relevant stakeholders regarding where AI can go wrong in the context of education. This means that the drawbacks of using AI like discrimination against certain groups due to data deficiencies, stigmatization due to reliance on certain machine learning modelling deficiencies and exploitation of personal data due to lack of awareness can go unnoticed without any accountability.

An AI wrongly predicting that a particular student will not perform very well in end of year exams or might drop out next year can play a very important role in determining that student’s reputation in front of teachers and parents. This reputation will determine how these teachers and parents treat that learner, resulting in a huge psychological impact on that learner, based on this wrong description by an AI tool. One high-profile case of harm was in the use of an algorithm to predict university entry results for students unable to take exams due to the pandemic. The system was shown to be biased against students from poorer backgrounds. Like other sectors where AI is making a huge impact, in AIEd this raises an important ethical question regarding giving students the freedom to opt out of AI powered predictions and automated evaluations.

The ethical implications of AI in education are dependent on the kind of disruption AI is doing in the ed-tech sector. On the one hand, this can be at an individual level for example by recommending wrong learning materials to students, or it can collectively impact relationships between different stakeholders such as how teachers perceive learners’ progress. This can also lead to automation bias and issues of accountability [ 67 ] where teachers begin to blindly rely on AI tools and prefer the tool’s outcomes over their own better judgement, whenever there is a conflict.

Initiatives have been observed in this space. For example, Professor Rose Luckin, professor of learner centered design at University College London along with Sir Anthony Seldon, vice chancellor of the University of Buckingham and Priya Lakhani, founder and CEO of Century Tech founded the Institute of Ethical AI in Education (IEAIEd) [ 72 ] to create awareness and promote the ethical aspects of AI in education. In its interim report, the institute identified seven different requirements for ethical AI to mitigate any kind of risks for learners. This included human agency and oversight to double-check AI’s performance, technical robustness and safety to prevent AI going wrong with new data or being hacked; diversity to ensure similar distribution of different demographics in data and avoid bias; non-discrimination and fairness to prevent anyone from being unfairly treated by AI; privacy and data governance to ensure everyone has the right to control their data; transparency to enhance the understanding of AI products; societal and environmental well-being to ensure that AI is not causing any harm and accountability to ensure that someone takes the responsibility for any wrongdoings of AI. Recently, the institute has also published a framework [ 71 ] for educators, schools and ed-tech companies to help them with the selection of ed-tech products with various ethical considerations in mind, like ethical design, transparency, privacy etc.

With the focus on online learning during the pandemic, and more utilization of AI powered ed-tech tools, risks of AI going wrong have increased significantly for all the stakeholders including ed-tech companies, schools, teachers and learners. A lot more work needs to be done on ethical AI in learning contexts to mitigate these risks, including assessment balancing risks and opportunities.

UNESCO published ‘Beijing Consensus’ on AI and Education that recommended member states to take a number of actions for the smooth and positively impactful integration of AI with education [ 74 ]. International bodies like EU have also recently published a set of draft guidelines under the heading of EU AI Act to ban certain uses of AI and categorize some as ‘high risk’ [ 47 ].

5 Future work

With the focus on online education due to Covid’19 in the past year, it will be consequential to see what AI has to offer for education with vast amounts of data being collected online through Learning Management Systems (LMS) and Massive Online Open Courses (MOOCS).

With this influx of educational data, AI techniques such as reinforcement learning can also be utilized to empower ed-tech. Such algorithms perform best with the large amounts of data that was limited to very few ed-tech companies in 2021. These algorithms have achieved breakthrough performance in multiple domains including games [ 66 ], healthcare [ 14 ] and robotics [ 34 ]. This presents a great opportunity for AI’s applications in education for further enhancing students’ learning outcomes, reducing teachers’ workloads [ 30 ] and making learning personalized [ 64 ], interactive and fun [ 50 , 53 ] for teachers and students.

With a growing number of AI powered ed-tech products in future, there will also be a lot of research on ethical AIEd. The risks of AI going wrong in education and the psychological impact this can have on learners and teachers is huge. Hence, more work needs to be done to ensure robust and safe AI products for all the stakeholders.

This can begin from the ed-tech companies sharing detailed guidelines for using AI powered ed-tech products, particularly specifying when not to rely on them. This includes the detailed documentation of the entire machine learning development pipeline with the assumptions made, data processing approaches used and the processes followed for selecting machine learning models. Regulators can play a very important role in ensuring that certain ethical principles are followed in developing these AI products or there are certain minimum performance thresholds that these products achieve [ 32 ].

6 Conclusion

AIEd promised a lot in its infancy around 3 decades back. However, there are still a number of AI breakthroughs required to see that kind of disruption in education at scale (including basic infrastructure). In the end, the goal of AIEd is not to promote AI, but to support education. In essence, there is only one way to evaluate the impact of AI in Education: through learning outcomes. AIEd for reducing teachers’ workload is a lot more impactful if the reduced workload enables teachers to focus on students’ learning, leading to better learning outcomes.

Cutting edge AI by researchers and companies around the world is not of much use if it is not helping the primary grade student in learning. This problem becomes extremely challenging because every learner is unique with different learning pathways. With the recent developments in AI, particularly reinforcement learning techniques, the future holds exciting possibilities of where AI will take education. For impactful AI in education, learners and teachers always need to be at the epicenter of AI development.

About Amazon.: Helping 7,00,000 students transition to remote learning. https://www.aboutamazon.com/news/community/helping-700-000-students-transition-to-remote-learning (2020)

Al-Maroof, R.A.S., Al-Emran, M.: Students acceptance of google classroom: an exploratory study using PLS–SEM approach. Int. J. Emerg. Technol Learn. (2018). https://doi.org/10.3991/ijet.v13i06.8275

Article   Google Scholar  

Amazon Web Services, Inc. (n.d.).: Amazon Web Services, Inc. https://pages.awscloud.com/whitepaper-emerging-trends-in-education.html (2020)

Baker, R.S.: Stupid tutoring systems, intelligent humans. Int. J. Artif. Intell. Educ. 26 (2), 600–614 (2016)

Baturay, M.H.: An overview of the world of MOOCs. Procedia. Soc. Behav. Sci. 174 , 427–433 (2015)

Baylari, A., Montazer, G.A.: Design a personalized e-learning system based on item response theory and artificial neural network approach. Expert. Syst. Appl. 36 (4), 8013–8021 (2009)

Beck, J., Stern, M., Haugsjaa, E.: Applications of AI in education. Crossroads 3 (1), 11–15 (1996). https://doi.org/10.1016/j.eswa.2008.10.080

Beck, J.E.: Modeling the Student with Reinforcement Learning. Proceedings of the Machine learning for User Modeling Workshop at the Sixth International Conference on User Modeling (1997)

Beck, J.E., Woolf, B.P., Beal, C.R.: ADVISOR: A machine learning architecture for intelligent tutor construction. Proceedings of the 7th National Conference on Artificial Intelligence, New York, ACM, 552–557 (2000)

Boekaerts, M.: Self-regulated learning: where we are today. Int. J. Educ. Res. 31 (6), 445–457 (1999)

Brundage, M., Avin, S., Wang, J., Belfield, H., Krueger, G., Hadfield, G., Maharaj, T.: Toward trustworthy AI development: mechanisms for supporting verifiable claims. arXiv preprint arXiv:2004.07213 (2020)

Bull, S., Kay, J.: Open learner models. In: Nkambou, R., Bourdeau, J., Mizoguchi, R. (eds.) Studies in computational intelligence, pp. 301–322. Springer, Berlin (2010)

Google Scholar  

Cunha-Perez, C., Arevalillo-Herraez, M., Marco-Gimenez, L., Arnau, D.: On incorporating affective support to an intelligent tutoring system: an empirical study. IEEE. R. Iberoamericana. De. Tecnologias. Del. Aprendizaje. 13 (2), 63–69 (2018)

Callaway, E.: “It will change everything”: DeepMind’s AI makes gigantic leap in solving protein structures. Nature. https://www.nature.com/articles/d41586-020-03348-4 . (2020)

Cambridge University Press and Educate Ventures. Shock to the system: lessons from Covid-19 Volume 1: Implications and recommendations. https://www.cambridge.org/pk/files/1616/1349/4545/Shock_to_the_System_Lessons_from_Covid19_Volume_1.pdf (2021). Accessed 12 Apr 2021

Deng, R., Benckendorff, P., Gannaway, D.: Progress and new directions for teaching and learning in MOOCs. Comput. Educ. 129 , 48–60 (2019)

Erümit, A.K., Çetin, İ: Design framework of adaptive intelligent tutoring systems. Educ. Inf. Technol. 25 (5), 4477–4500 (2020)

Fang, Y., Ren, Z., Hu, X., Graesser, A.C.: A meta-analysis of the effectiveness of ALEKS on learning. Educ. Psychol. 39 (10), 1278–1292 (2019)

Floridi, L.: Soft ethics, the governance of the digital and the general data protection regulation. Philos. Trans. R. Soc. A. Math. Phys. Eng. Sci. 376 (2133), 20180081 (2018)

Goldstein, I.J.: The genetic graph: a representation for the evolution of procedural knowledge. Int. J. Man. Mach. Stud. 11 (1), 51–77 (1979)

Goryachikh, S.P., Sozinova, A.A., Grishina, E.N., Nagovitsyna, E.V.: Optimisation of the mechanisms of managing venture investments in the sphere of digital education on the basis of new information and communication technologies: audit and reorganisation. IJEPEE. 13 (6), 587–594 (2020)

Grawemeyer, B., Gutierrez-Santos, S., Holmes, W., Mavrikis, M., Rummel, N., Mazziotti, C., Janning, R.: Talk, tutor, explore, learn: intelligent tutoring and exploration for robust learning, p. 2015. AIED, Madrid (2015)

Hansen, A., Mavrikis, M.: Learning mathematics from multiple representations: two design principles. ICTMT-12, Faro (2015)

Hasan, M.A., Noor, N.F.M., Rahman, S.S.A., Rahman, M.M.: The transition from intelligent to affective tutoring system: a review and open issues. IEEE Access (2020). https://doi.org/10.1109/ACCESS.2020.3036990

Heffernan, N.T., Heffernan, C.L.: The ASSISTments ecosystem: building a platform that brings scientists and teachers together for minimally invasive research on human learning and teaching. Int. J. Artif. Intell. Educ. (2014). https://doi.org/10.1007/s40593-014-0024-x

Heffernan, N.T., Koedinger, K.R.: An intelligent tutoring system incorporating a model of an experienced human tutor. Proceedings of the 6th International Conference on Intelligent Tutoring Systems, 2363, p 596–608, (2002)

Hill, P., Barber, M.: Preparing for a Renaissance in Assessment. Pearson, London (2014)

Hollands, F.M., Tirthali, D.: Why do institutions offer MOOCs? Online Learning 18 (3), 3 (2014)

Iftakhar, S.: Google classroom: what works and how. J. Educ. Soc. Sci. 3 (1), 12–18 (2016)

Iglesias, A., Martínez, P., Aler, R., Fernández, F.: Reinforcement learning of pedagogical policies in adaptive and intelligent educational systems. Knowl. Based. Syst. 22 (4), 266–270 (2009)

Johnson, D.G., Verdicchio, M.: AI, agency and responsibility: the VW fraud case and beyond. Ai. Soc. 34 (3), 639–647 (2019)

Kazim, E., Denny, D.M.T., Koshiyama, A.: AI auditing and impact assessment: according to the UK information commissioner’s office. AI. Ethics. 1 , 1–10 (2021)

Kazim, E., Koshiyama, A.: A High-Level Overview of AI Ethics. SSRN J (2020). https://doi.org/10.2139/ssrn.3609292

Kober, J., Bagnell, J.A., Peters, J.: Reinforcement learning in robotics: a survey. Int. J. Robot. Res. 32 (11), 1238–1274 (2013)

Koshiyama, A., Kazim, E., Treleaven, P., Rai, P., Szpruch, L., Pavey, G., Ahamat, G., Leutner, F., Goebel, R., Knight, A., Adams, J., Hitrova, C., Barnett, J., Nachev, P., Barber, D., Chamorro-Premuzic, T., Klemmer, K., Gregorovic, M., Khan, S., Lomas, E.: Towards algorithm auditing a survey on managing legal ethical and technological risks of AI, ML and associated algorithms. SSRN J (2021). https://doi.org/10.2139/ssrn.3778998

LaPierre, J.: How AI Enhances Collaborative Learning. Filament Games (2018). https://www.filamentgames.com/blog/how-ai-enhances-collaborative-learning/ . Accessed 12 Apr 2021

Luckin, R.: Towards artificial intelligence-based assessment systems. Nat. Hum. Behav. (2017). https://doi.org/10.1038/s41562-016-0028

Luckin, R., du Boulay, B.: Int. J. Artif. Intell. Educ. 26 , 416–430 (2016)

Luckin, R., Holmes, W., Griffiths, M., Pearson, L.: Intelligence Unleashed An argument for AI in Education. https://static.googleusercontent.com/media/edu.google.com/en//pdfs/Intelligence-Unleashed-Publication.pdf (2016)

Barron-Estrada M.L., Zatarain-Cabada, R., Oramas-Bustillos, R., Gonzalez-Hernandez, F.: Sentiment analysis in an affective intelligent tutoring system. Proc. IEEE 17th Int. Conf. Adv. Learn. Technol. (ICALT), Timisoara pp. 394–397 2017.

Ma, W., Adesope, O., Nesbit, J.C., Liu, Q.: Intelligent tutoring systems and learning outcomes: a meta-analysis. J. Educ. Psychol. 106 (4), 901–918 (2014)

Makridakis, S.: The forthcoming Artificial Intelligence (AI) revolution: Its impact on society and firms. Futures 90 , 46–60 (2017)

Mavrikis, M.: Int. J. Artif. Intell. Tools. 19 , 733–753 (2010)

Merrill, D.C., Reiser, B.J., Ranney, M., Trafton, J.G.: Effective tutoring techniques: a comparison of human tutors and intelligent tutoring systems. J. Learn. Sci. 2 (3), 277–305 (1992)

Moeini, A.: Theorising Evidence-Informed Learning Technology Enterprises: A Participatory Design-Based Research Approach. Doctoral dissertation, UCL University College London, London, (2020)

Mohamed, H., Lamia, M.: Implementing flipped classroom that used an intelligent tutoring system into learning process. Comput. Educ. 124 , 62–76 (2018). https://doi.org/10.1016/j.compedu.2018.05.011

Mueller, B.: The Artificial Intelligence Act: A Quick Explainer. [online] Center for Data Innovation (2021). https://datainnovation.org/2021/05/the-artificial-intelligence-act-a-quick-explainer/ . Accessed 12 Apr 2021

Murray, M.C., Pérez, J.: Informing and performing: A study comparing adaptive learning to traditional learning. Inform. Sci. J. 18 , 111–125 (2015)

Oudeyer, P-Y.: Computational Theories of Curiosity-Driven Learning. https://arxiv.org/pdf/1802.10546.pdf (2018)

Park, H.W., Grover, I., Spaulding, S., Gomez, L., Breazeal, C.: A model-free affective reinforcement learning approach to personalization of an autonomous social robot companion for early literacy education. AAAI. 33 (1), 687–694 (2019)

Resnick, M., Robinson, K.: Lifelong kindergarten: cultivating creativity through projects, passion, peers, and play. MIT press, Cambridge (2017)

Book   Google Scholar  

Rodríguez-Triana, M.J., Prieto, L.P., Martínez-Monés, A., Asensio-Pérez, J.I. and Dimitriadis, Y.: The teacher in the loop: Customizing multimodal learning analytics for blended learning. In Proceedings of the 8th international conference on learning analytics and knowledge. pp 417–426 (2018)

Rowe, J.P., Lester, J.C.: Improving student problem solving in narrative-centered learning environments: a modular reinforcement learning framework. In International Conference on Artificial Intelligence in Education. pp. 419–428. Springer, Cham (2015)

Russell, S.J., Norvig, P., Davis, E.: Artificial intelligence: a modern approach. Prentice Hall, Upper Saddle River (2010)

MATH   Google Scholar  

Jiménez, S., Juárez-Ramírez, R., Castillo, V.H., Licea, G., Ramírez-Noriega, A., Inzunza, S.: A feedback system to provide affective support to students. Comput. Appl. Eng. Educ. 26 (3), 473–483 (2018)

Sadler, D.R.: Formative assessment in the design of instructional systems. Instr. Sci. 18 , 119–144 (1989)

Samarakou, M., Fylladitakis, E., Prentakis, P., Athineos, S.: Implementation of artificial intelligence assessment in engineering laboratory education. https://files.eric.ed.gov/fulltext/ED557263.pdf (2014). Accessed 24 Feb 2021

Segal, A., Hindi, S., Prusak, N., Swidan, O., Livni, A., Palatnic, A., Schwarz, B.: Keeping the teacher in the loop: Technologies for monitoring group learning in real-time. In International Conference on Artificial Intelligence in Education. pp. 64–76. Springer, Cham (2017)

Self, J. A. (1990). Theoretical foundations of intelligent tutoring systems. J. Artif. Intell

Self, J.A.: The defining characteristics of intelligent tutoring systems research: ITSs care, precisely. IJAIEd. 10 , 350–364 (1998)

Selwood, I., Pilkington, R.: Teacher workload: using ICT to release time to teach. Educ. Rev. 57 (2), 163–174 (2005)

Shaharanee, I.N.M., Jamil, J.M. and Rodzi, S.S.M.:æ Google classroom as a tool for active learning. AIP Conference Proceedings, 1761 (1), pp. 020069, AIP Publishing LLC, College Park (2016)

Shaharanee, I.N.M., Jamil, J.M., Rodzi, S.S.M.: The application of Google Classroom as a tool for teaching and learning. J. Telecommun. Electron. Comp. Eng. 8 (10), 5–8 (2016)

Shawky, D., Badawi, A.: Towards a personalized learning experience using reinforcement learning. In: Hassanien, A.E. (ed.) Machine learning paradigms Theory and application, pp. 169–187. Springer (2019)

Shute, V.J. (1991). Rose garden promises of intelligent tutoring systems: blossom or thorn.  NASA, Lyndon B. Johnson Space Center, Fourth Annual Workshop on Space Operations Applications and Research (SOAR 90) . Available at: https://ntrs.nasa.gov/citations/19910011382 . Accessed 4 July 2021

Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S.: Mastering the game of Go with deep neural networks and tree search. Nature 529 (7587), 484–489 (2016)

Skitka, L.J., Mosier, K., Burdick, M.D.: Accountability and automation bias. Int. J. Hum. Comput. Stud. 52 (4), 701–717 (2000)

Steenbergen-Hu, S., Cooper, H.: A meta-analysis of the effectiveness of intelligent tutoring systems on K–12 students’ mathematical learning. J. Educ. Psychol. 105 (4), 970–987 (2013)

Sudarsana, I.K., Putra, I.B., Astawa, I.N.T., Yogantara, I.W.L.: The use of google classroom in the learning process. J. Phys. Conf. Ser 1175 (1), 012165 (2019)

TechCrunch. Indian education startup Byju’s is fundraising at a $10B valuation. https://techcrunch.com/2020/05/01/indian-education-startup-byjus-is-fundraising-at-a-10b-valuation/ (2020). Accessed 12 Apr 2021

The Institute for Ethical AI in Education The Ethical Framework for AI in Education (IEAIED). https://fb77c667c4d6e21c1e06.b-cdn.net/wp-content/uploads/2021/03/The-Ethical-Framework-for-AI-in-Education-Institute-for-Ethical-AI-in-Education-Final-Report.pdf (2021). Accessed 12 Apr 2021

The Institute for Ethical AI in Education The Ethical Framework for AI in Education (n.d.). Available at: https://www.buckingham.ac.uk/wp-content/uploads/2021/03/The-Institute-for-Ethical-AI-in-Education-The-Ethical-Framework-for-AI-in-Education.pdf . Accessed 4 July 2021

Tisseron, S., Tordo, F., Baddoura, R.: Testing Empathy with Robots: a model in four dimensions and sixteen ítems. Int. J. Soc. Robot. 7 (1), 97–102 (2015)

UNESCO. Artificial intelligence in education. UNESCO. https://en.unesco.org/artificial-intelligence/education . (2019). Accessed 12 Apr 2021

Utterberg Modén, M., Tallvid, M., Lundin, J., Lindström, B.: Intelligent Tutoring Systems: Why Teachers Abandoned a Technology Aimed at Automating Teaching Processes. In: Proceedings of the 54th Hawaii International Conference on System Sciences, Maui, p. 1538 (2021)

van der Spoel, I., Noroozi, O., Schuurink, E., van Ginkel, S.: Teachers’ online teaching expectations and experiences during the Covid19-pandemic in the Netherlands. Eur. J. Teach. Educ. 43 (4), 623–638 (2020)

Weller, M.: Twenty years of EdTech. Educa. Rev. Online. 53 (4), 34–48 (2018)

Wenger, E.: Artificial intelligence and tutoring systems. Morgan Kauffman, Los Altos (1987)

World Economic Forum and The Boston Consulting Group. New vision for education unlocking the potential of technology industry agenda prepared in collaboration with the Boston consulting group. http://www3.weforum.org/docs/WEFUSA_NewVisionforEducation_Report2015.pdf (2015). Accessed 12 Apr 2021

Yampolskiy, R.V., Spellchecker, M.S.: Artificial intelligence safety and cybersecurity: a timeline of AI failures. arXiv:1610.07997 (2016)

Yu, H., Miao, C., Leung, C., White, T.J.: Towards AI-powered personalization in MOOC learning. Npj. Sci. Learn. 2 (1), 1–5 (2017)

Yu, H., Shen, Z., Miao, C., Leung, C., Lesser, V.R., Yang, Q.: Building ethics into artificial intelligence. arXiv:1812.02953 (2018)

Zemel, R., Wu Y., Swersky, K., Pitassi, T., Dwork, C.: Learning fair representations. In: International Conference on Machine Learning, pp. 325–333 (2013)

Zhang, Y., Liao, Q.V., Bellamy, R.K.E.: Effect of confidence and explanation on accuracy and trust calibration in AI-assisted decision making. In: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. arXiv:2001.02114 (2020)

Zimmerman, B.J., Schunk, D.H.: Handbook of Self-Regulation of Learning and Performance. Routledge, Oxfordshire (2011)

Download references

Author information

Authors and affiliations.

Artificial Intelligence at University College, London, UK

Muhammad Ali Chaudhry

Department of Computer Science, University College, London, UK

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Muhammad Ali Chaudhry .

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Chaudhry, M.A., Kazim, E. Artificial Intelligence in Education (AIEd): a high-level academic and industry note 2021. AI Ethics 2 , 157–165 (2022). https://doi.org/10.1007/s43681-021-00074-z

Download citation

Received : 25 April 2021

Accepted : 17 June 2021

Published : 07 July 2021

Issue Date : February 2022

DOI : https://doi.org/10.1007/s43681-021-00074-z

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Artificial Intelligence
  • Machine learning
  • Learning science
  • Artificial Intelligence in Education (AIEd)
  • Intelligent Tutoring Systems (ITS)
  • Find a journal
  • Publish with us
  • Track your research
  • How it works

researchprospect post subheader

Useful Links

How much will your dissertation cost?

Have an expert academic write your dissertation paper!

Dissertation Services

Dissertation Services

Get unlimited topic ideas and a dissertation plan for just £45.00

Order topics and plan

Order topics and plan

Get 1 free topic in your area of study with aim and justification

Yes I want the free topic

Yes I want the free topic

Artificial Intelligence Topics for Dissertations

Published by Carmen Troy at January 6th, 2023 , Revised On August 16, 2023

Introduction

Artificial intelligence (AI) is the process of building machines, robots, and software that are intelligent enough to act like humans. With artificial intelligence, the world will move into a future where machines will be as knowledgeable as humans, and they will be able to work and act as humans.

When completely developed, AI-powered machines will replace a lot of humans in a lot of fields. But would that take away power from the humans? Would it cause humans to suffer as these machines will be intelligent enough to carry out daily tasks and perform routine work? Will AI wreak havoc in the coming days? Well, these are questions that can only be answered after thorough research.

To understand how powerful AI machines will be in the future and what sort of a world we will witness, here are the best AI topics you can choose for your dissertation.

You may also want to start your dissertation by requesting  a brief research proposal  from our writers on any of these topics, which includes an  introduction  to the topic,  research question ,  aim and objectives ,  literature review  along with the proposed  methodology  of research to be conducted.  Let us know  if you need any help in getting started.

Check our  dissertation examples  to get an idea of  how to structure your dissertation .

Review the full list of  dissertation topics for 2022 here.

You may also be interested in technology dissertation topics , computer engineering dissertation topics , networking dissertation topics , and data security dissertation topics .

2022 Artificial Intelligence Topics for Dissertations

Topic 1: artificial intelligence (ai) and supply chain management- an assessment of the present and future role played by ai in supply chain process: a case of ibm corporation in the us.

Research Aim: This research aims to find the present, and future role AI plays in supply chain management. It will analyze how AI affects various components of the supply chain process, such as procurement, distribution, etc. It will use the case study of IBM Corporation, which uses AI in the US to make the supply chain process more efficient and reduce losses. Moreover, through various technological and business frameworks, it will recommend changes in the current AI-based supply chain models to improve their efficiency.

Topic 2: Artificial Intelligence (AI) and Blockchain Technology a Transition Towards Decentralized and Automated Finance- A Study to Find the Role of AI and Blockchains in Making Various Segments of Financial Sector Automated and Decentralized

Research Aim: This study will analyze the role of AI and blockchains in making various segments of financial markets (banking, insurance, investment, stock market, etc.) automated and decentralized. It will find how AI and blockchains can eliminate the part of intimidators and commission charging players such as large banks and corporations to make the economy and financial system more efficient and cheaper. Therefore, it will study applications of various AI and blockchain models to show how they can affect economic governance.

Topic 3: AI and Healthcare- A Comparative Analysis of the Machine Learning (ML) and Deep Learning Models for Cancer Diagnosis

Research Aim: This study aims to identify the role of AI in modern healthcare. It will analyze the efficacy of the contemporary ML and DL models for cancer diagnosis. It will find how these models diagnose cancer, which technology ML or DL does it better, and how much better efficient. Moreover, it will also discuss criticism of these models and ways to improve them for better results.

Topic 4: Are AI and Big Data Analytics New Tools for Digital Innovation? An Assessment of Available Blockchain and Data Analytics Tools for Startups Development

Research Aim: This study aims to assess the role of present AI and data analytics tools for startups development. It will identify how modern startups use these technologies in their development stages to innovate and increase their effectiveness. Moreover, it will analyze its macroeconomic effects by examining its role in speeding up the startup culture, creating more employment, and rising incomes.

Topic 5: The Role of AI and Robotics in Economic Growth and Development- A Case of Emerging Economies

Research Aim: This study aims to find the impact of AI and Robotics on economic growth and development in emerging economies. It will identify how AI and Robotics speed up production and other business-related processes in emerging economies, create more employment, and raise aggregate income levels. Moreover, it will see how it leads to innovation and increasing attention towards learning modern skills such as web development, data analytics, data science, etc. Lastly, it will use two or three emerging countries as a case study to show the analysis.

Artificial Intelligence Topics

Topic 1: machine learning and artificial intelligence in the next generation wearable devices.

Research Aim: This study will aim to understand the role of machine learning and big data in the future of wearables. The research will focus on how an individual’s health and wellbeing can be improved with devices that are powered by AI. The study will first focus on the concept of ML and its implications in various fields. Then, it will be narrowed down to the role of machine learning in the future of wearable devices and how it can help individuals improve their daily routine and lifestyle and move towards a better and healthier life. The research will then conclude how ML will play its role in the future of wearables and help people improve their well-being.

Topic 2: Automation, machine learning and artificial intelligence in the field of medicine

Research Aim: Machine learning and artificial intelligence play a huge role in the field of medicine. From diagnosis to treatment, artificial intelligence is playing a crucial role in the healthcare industry today. This study will highlight how machine learning and automation can help doctors provide the right treatment to patients at the right time. With AI-powered machines, advanced diagnostic tests are being introduced to track diseases much before their occurrence. Moreover, AI is also helping in developing drugs at a faster pace and personalised treatment. All these aspects will be discussed in this study with relevant case studies.

Topic 3: Robotics and artificial intelligence – Assessing the Impact on business and economics

Research Aim: Businesses are changing the way they work due to technological advancements. Robotics and artificial intelligence have paved the way for new technologies and new methods of working. Many people argue that the introduction of robotics and AI will adversely impact humans as most of them might be replaced by AI-powered machines. While this cannot be denied, this artificial intelligence research topic will aim to understand how much the business will be impacted by these new technologies and assess the future of robotics and artificial intelligence in different businesses.

Topic 4: Artificial intelligence governance: Ethical, legal and social challenges

Research Aim: With artificial intelligence taking over the world, many people have reservations over the technology tracking people and their activities 24/7. They have called for strict governance for these intelligent systems and demanded that this technology be fair and transparent. This research will address these issues and present the ethical, legal, and social challenges governing AI-powered systems. The study will be qualitative in nature and will talk about the various ways through which artificial intelligence systems can be governed. It will also address the challenges that will hinder fair and transparent governance.

Topic 5: Will quantum computing improve artificial intelligence? An analysis

Research Aim: Quantum computing (QC) is set to revolutionize the field of artificial intelligence. According to experts, quantum computing combined with artificial intelligence will change medicine, business, and the economy. This research will first introduce the concept of quantum computing and will explain how powerful it is. The study will then talk about how quantum computing will change and help increase the efficiency of artificially intelligent systems. Examples of algorithms that quantum computing utilises will also be presented to help explain how this field of computer science will help improve artificial intelligence.

Topic 6: The role of deep learning in building intelligent systems

Research Aim: Deep learning, an essential branch of artificial intelligence, utilizes neural networks to assess the various factors similar to a human neural system. This research will introduce the concept of deep learning and discuss how it works in artificial intelligence. Deep learning algorithms will also be explored in this study to have a deeper understanding of this artificial intelligence topic. Using case examples and evidence, the research will explore how deep learning assists in creating machines that are intelligent and how they can process information like a human being. The various applications of deep learning will also be discussed in this study.

Topic 7: Evaluating the role of natural language processing in artificial intelligence

Research Aim: Natural language processing (NLP) is an essential element of artificial intelligence. It provides systems and machines with the ability to read, understand and interpret the human language. With the help of natural language processing, systems can even measure sentiments and predict which parts of human language are important. This research will aim to evaluate the role of this language in the field of artificial intelligence. It will further assist in understanding how natural language processing helps build intelligent systems that various organizations can use. Furthermore, the various applications of NLP will also be discussed.

Topic 8: Application of computer vision in building intelligent systems

Research Aim: Computer vision in the field of artificial intelligence makes systems so smart that they can analyze and understand images and pictures. These machines then derive some intelligence from the image that has been fed to the system. This research will first aim to understand computer vision and its role in artificial intelligence. A framework will be presented that will explain the working of computer vision in artificial intelligence. This study will present the applications of computer vision to clarify further how artificial intelligence uses computer vision to build smart systems.

Topic 9: Analysing the use of the IoT in artificial intelligence

Research Aim: The Internet of things and artificial intelligence are two separate, powerful tools. IoT can connect devices wirelessly, which can perform a set of actions without human intervention. When this powerful tool is combined with artificial intelligence, systems become extremely powerful to simulate human behaviour and make decisions without human interference. This artificial intelligence topic will aim to analyze the use of the internet of things in artificial intelligence. Machines that use IoT and AI will be analyzed, and the study will present how human behaviour is simulated so accurately.

Topic 10: Recommender systems – exploring its power in e-commerce

Research Aim: Recommender systems use algorithms to offer relevant suggestions to users. Be it a product, a service, a search result, or a movie/TV show/series. Users receive tons of recommendations after searching for a particular product or browsing their favourite TV shows list. With the help of AI, recommender systems can offer relevant and accurate suggestions to users. The main aim of this research will be to explore the use of recommender systems in e-commerce. Industry giants use this tool to help customers find the product or service they are looking for and make the right decision. This research will discuss where recommender systems are used, how they are implemented, and their results for e-commerce businesses.

Free Dissertation Topic

Phone Number

Academic Level Select Academic Level Undergraduate Graduate PHD

Academic Subject

Area of Research

Frequently Asked Questions

How to find artificial intelligence dissertation topics.

To find artificial intelligence dissertation topics:

  • Study recent AI advancements.
  • Explore ethical concerns.
  • Investigate AI in specific industries.
  • Analyze AI’s societal impact.
  • Consider human-AI interaction.
  • Select a topic aligning with your expertise and passion.

You May Also Like

Do you have a dissertation topic in the field of information technology? If not, our competent dissertation writers are at your disposal. The importance of technology research cannot be overstated.

If you are looking for a dissertation topic, you can find one here. We have listed some of the best dissertation topics on graphic design. Choose one and get started immediately!

Check out the list of most interesting 100+ chemistry dissertation topic ideas trending lately to help you write an exceptional research paper.

USEFUL LINKS

LEARNING RESOURCES

researchprospect-reviews-trust-site

COMPANY DETAILS

Research-Prospect-Writing-Service

  • How It Works

Machine Learning - CMU

PhD Dissertations

PhD Dissertations

[all are .pdf files].

Learning Models that Match Jacob Tyo, 2024

Improving Human Integration across the Machine Learning Pipeline Charvi Rastogi, 2024

Reliable and Practical Machine Learning for Dynamic Healthcare Settings Helen Zhou, 2023

Automatic customization of large-scale spiking network models to neuronal population activity (unavailable) Shenghao Wu, 2023

Estimation of BVk functions from scattered data (unavailable) Addison J. Hu, 2023

Rethinking object categorization in computer vision (unavailable) Jayanth Koushik, 2023

Advances in Statistical Gene Networks Jinjin Tian, 2023 Post-hoc calibration without distributional assumptions Chirag Gupta, 2023

The Role of Noise, Proxies, and Dynamics in Algorithmic Fairness Nil-Jana Akpinar, 2023

Collaborative learning by leveraging siloed data Sebastian Caldas, 2023

Modeling Epidemiological Time Series Aaron Rumack, 2023

Human-Centered Machine Learning: A Statistical and Algorithmic Perspective Leqi Liu, 2023

Uncertainty Quantification under Distribution Shifts Aleksandr Podkopaev, 2023

Probabilistic Reinforcement Learning: Using Data to Define Desired Outcomes, and Inferring How to Get There Benjamin Eysenbach, 2023

Comparing Forecasters and Abstaining Classifiers Yo Joong Choe, 2023

Using Task Driven Methods to Uncover Representations of Human Vision and Semantics Aria Yuan Wang, 2023

Data-driven Decisions - An Anomaly Detection Perspective Shubhranshu Shekhar, 2023

Applied Mathematics of the Future Kin G. Olivares, 2023

METHODS AND APPLICATIONS OF EXPLAINABLE MACHINE LEARNING Joon Sik Kim, 2023

NEURAL REASONING FOR QUESTION ANSWERING Haitian Sun, 2023

Principled Machine Learning for Societally Consequential Decision Making Amanda Coston, 2023

Long term brain dynamics extend cognitive neuroscience to timescales relevant for health and physiology Maxwell B. Wang, 2023

Long term brain dynamics extend cognitive neuroscience to timescales relevant for health and physiology Darby M. Losey, 2023

Calibrated Conditional Density Models and Predictive Inference via Local Diagnostics David Zhao, 2023

Towards an Application-based Pipeline for Explainability Gregory Plumb, 2022

Objective Criteria for Explainable Machine Learning Chih-Kuan Yeh, 2022

Making Scientific Peer Review Scientific Ivan Stelmakh, 2022

Facets of regularization in high-dimensional learning: Cross-validation, risk monotonization, and model complexity Pratik Patil, 2022

Active Robot Perception using Programmable Light Curtains Siddharth Ancha, 2022

Strategies for Black-Box and Multi-Objective Optimization Biswajit Paria, 2022

Unifying State and Policy-Level Explanations for Reinforcement Learning Nicholay Topin, 2022

Sensor Fusion Frameworks for Nowcasting Maria Jahja, 2022

Equilibrium Approaches to Modern Deep Learning Shaojie Bai, 2022

Towards General Natural Language Understanding with Probabilistic Worldbuilding Abulhair Saparov, 2022

Applications of Point Process Modeling to Spiking Neurons (Unavailable) Yu Chen, 2021

Neural variability: structure, sources, control, and data augmentation Akash Umakantha, 2021

Structure and time course of neural population activity during learning Jay Hennig, 2021

Cross-view Learning with Limited Supervision Yao-Hung Hubert Tsai, 2021

Meta Reinforcement Learning through Memory Emilio Parisotto, 2021

Learning Embodied Agents with Scalably-Supervised Reinforcement Learning Lisa Lee, 2021

Learning to Predict and Make Decisions under Distribution Shift Yifan Wu, 2021

Statistical Game Theory Arun Sai Suggala, 2021

Towards Knowledge-capable AI: Agents that See, Speak, Act and Know Kenneth Marino, 2021

Learning and Reasoning with Fast Semidefinite Programming and Mixing Methods Po-Wei Wang, 2021

Bridging Language in Machines with Language in the Brain Mariya Toneva, 2021

Curriculum Learning Otilia Stretcu, 2021

Principles of Learning in Multitask Settings: A Probabilistic Perspective Maruan Al-Shedivat, 2021

Towards Robust and Resilient Machine Learning Adarsh Prasad, 2021

Towards Training AI Agents with All Types of Experiences: A Unified ML Formalism Zhiting Hu, 2021

Building Intelligent Autonomous Navigation Agents Devendra Chaplot, 2021

Learning to See by Moving: Self-supervising 3D Scene Representations for Perception, Control, and Visual Reasoning Hsiao-Yu Fish Tung, 2021

Statistical Astrophysics: From Extrasolar Planets to the Large-scale Structure of the Universe Collin Politsch, 2020

Causal Inference with Complex Data Structures and Non-Standard Effects Kwhangho Kim, 2020

Networks, Point Processes, and Networks of Point Processes Neil Spencer, 2020

Dissecting neural variability using population recordings, network models, and neurofeedback (Unavailable) Ryan Williamson, 2020

Predicting Health and Safety: Essays in Machine Learning for Decision Support in the Public Sector Dylan Fitzpatrick, 2020

Towards a Unified Framework for Learning and Reasoning Han Zhao, 2020

Learning DAGs with Continuous Optimization Xun Zheng, 2020

Machine Learning and Multiagent Preferences Ritesh Noothigattu, 2020

Learning and Decision Making from Diverse Forms of Information Yichong Xu, 2020

Towards Data-Efficient Machine Learning Qizhe Xie, 2020

Change modeling for understanding our world and the counterfactual one(s) William Herlands, 2020

Machine Learning in High-Stakes Settings: Risks and Opportunities Maria De-Arteaga, 2020

Data Decomposition for Constrained Visual Learning Calvin Murdock, 2020

Structured Sparse Regression Methods for Learning from High-Dimensional Genomic Data Micol Marchetti-Bowick, 2020

Towards Efficient Automated Machine Learning Liam Li, 2020

LEARNING COLLECTIONS OF FUNCTIONS Emmanouil Antonios Platanios, 2020

Provable, structured, and efficient methods for robustness of deep networks to adversarial examples Eric Wong , 2020

Reconstructing and Mining Signals: Algorithms and Applications Hyun Ah Song, 2020

Probabilistic Single Cell Lineage Tracing Chieh Lin, 2020

Graphical network modeling of phase coupling in brain activity (unavailable) Josue Orellana, 2019

Strategic Exploration in Reinforcement Learning - New Algorithms and Learning Guarantees Christoph Dann, 2019 Learning Generative Models using Transformations Chun-Liang Li, 2019

Estimating Probability Distributions and their Properties Shashank Singh, 2019

Post-Inference Methods for Scalable Probabilistic Modeling and Sequential Decision Making Willie Neiswanger, 2019

Accelerating Text-as-Data Research in Computational Social Science Dallas Card, 2019

Multi-view Relationships for Analytics and Inference Eric Lei, 2019

Information flow in networks based on nonstationary multivariate neural recordings Natalie Klein, 2019

Competitive Analysis for Machine Learning & Data Science Michael Spece, 2019

The When, Where and Why of Human Memory Retrieval Qiong Zhang, 2019

Towards Effective and Efficient Learning at Scale Adams Wei Yu, 2019

Towards Literate Artificial Intelligence Mrinmaya Sachan, 2019

Learning Gene Networks Underlying Clinical Phenotypes Under SNP Perturbations From Genome-Wide Data Calvin McCarter, 2019

Unified Models for Dynamical Systems Carlton Downey, 2019

Anytime Prediction and Learning for the Balance between Computation and Accuracy Hanzhang Hu, 2019

Statistical and Computational Properties of Some "User-Friendly" Methods for High-Dimensional Estimation Alnur Ali, 2019

Nonparametric Methods with Total Variation Type Regularization Veeranjaneyulu Sadhanala, 2019

New Advances in Sparse Learning, Deep Networks, and Adversarial Learning: Theory and Applications Hongyang Zhang, 2019

Gradient Descent for Non-convex Problems in Modern Machine Learning Simon Shaolei Du, 2019

Selective Data Acquisition in Learning and Decision Making Problems Yining Wang, 2019

Anomaly Detection in Graphs and Time Series: Algorithms and Applications Bryan Hooi, 2019

Neural dynamics and interactions in the human ventral visual pathway Yuanning Li, 2018

Tuning Hyperparameters without Grad Students: Scaling up Bandit Optimisation Kirthevasan Kandasamy, 2018

Teaching Machines to Classify from Natural Language Interactions Shashank Srivastava, 2018

Statistical Inference for Geometric Data Jisu Kim, 2018

Representation Learning @ Scale Manzil Zaheer, 2018

Diversity-promoting and Large-scale Machine Learning for Healthcare Pengtao Xie, 2018

Distribution and Histogram (DIsH) Learning Junier Oliva, 2018

Stress Detection for Keystroke Dynamics Shing-Hon Lau, 2018

Sublinear-Time Learning and Inference for High-Dimensional Models Enxu Yan, 2018

Neural population activity in the visual cortex: Statistical methods and application Benjamin Cowley, 2018

Efficient Methods for Prediction and Control in Partially Observable Environments Ahmed Hefny, 2018

Learning with Staleness Wei Dai, 2018

Statistical Approach for Functionally Validating Transcription Factor Bindings Using Population SNP and Gene Expression Data Jing Xiang, 2017

New Paradigms and Optimality Guarantees in Statistical Learning and Estimation Yu-Xiang Wang, 2017

Dynamic Question Ordering: Obtaining Useful Information While Reducing User Burden Kirstin Early, 2017

New Optimization Methods for Modern Machine Learning Sashank J. Reddi, 2017

Active Search with Complex Actions and Rewards Yifei Ma, 2017

Why Machine Learning Works George D. Montañez , 2017

Source-Space Analyses in MEG/EEG and Applications to Explore Spatio-temporal Neural Dynamics in Human Vision Ying Yang , 2017

Computational Tools for Identification and Analysis of Neuronal Population Activity Pengcheng Zhou, 2016

Expressive Collaborative Music Performance via Machine Learning Gus (Guangyu) Xia, 2016

Supervision Beyond Manual Annotations for Learning Visual Representations Carl Doersch, 2016

Exploring Weakly Labeled Data Across the Noise-Bias Spectrum Robert W. H. Fisher, 2016

Optimizing Optimization: Scalable Convex Programming with Proximal Operators Matt Wytock, 2016

Combining Neural Population Recordings: Theory and Application William Bishop, 2015

Discovering Compact and Informative Structures through Data Partitioning Madalina Fiterau-Brostean, 2015

Machine Learning in Space and Time Seth R. Flaxman, 2015

The Time and Location of Natural Reading Processes in the Brain Leila Wehbe, 2015

Shape-Constrained Estimation in High Dimensions Min Xu, 2015

Spectral Probabilistic Modeling and Applications to Natural Language Processing Ankur Parikh, 2015 Computational and Statistical Advances in Testing and Learning Aaditya Kumar Ramdas, 2015

Corpora and Cognition: The Semantic Composition of Adjectives and Nouns in the Human Brain Alona Fyshe, 2015

Learning Statistical Features of Scene Images Wooyoung Lee, 2014

Towards Scalable Analysis of Images and Videos Bin Zhao, 2014

Statistical Text Analysis for Social Science Brendan T. O'Connor, 2014

Modeling Large Social Networks in Context Qirong Ho, 2014

Semi-Cooperative Learning in Smart Grid Agents Prashant P. Reddy, 2013

On Learning from Collective Data Liang Xiong, 2013

Exploiting Non-sequence Data in Dynamic Model Learning Tzu-Kuo Huang, 2013

Mathematical Theories of Interaction with Oracles Liu Yang, 2013

Short-Sighted Probabilistic Planning Felipe W. Trevizan, 2013

Statistical Models and Algorithms for Studying Hand and Finger Kinematics and their Neural Mechanisms Lucia Castellanos, 2013

Approximation Algorithms and New Models for Clustering and Learning Pranjal Awasthi, 2013

Uncovering Structure in High-Dimensions: Networks and Multi-task Learning Problems Mladen Kolar, 2013

Learning with Sparsity: Structures, Optimization and Applications Xi Chen, 2013

GraphLab: A Distributed Abstraction for Large Scale Machine Learning Yucheng Low, 2013

Graph Structured Normal Means Inference James Sharpnack, 2013 (Joint Statistics & ML PhD)

Probabilistic Models for Collecting, Analyzing, and Modeling Expression Data Hai-Son Phuoc Le, 2013

Learning Large-Scale Conditional Random Fields Joseph K. Bradley, 2013

New Statistical Applications for Differential Privacy Rob Hall, 2013 (Joint Statistics & ML PhD)

Parallel and Distributed Systems for Probabilistic Reasoning Joseph Gonzalez, 2012

Spectral Approaches to Learning Predictive Representations Byron Boots, 2012

Attribute Learning using Joint Human and Machine Computation Edith L. M. Law, 2012

Statistical Methods for Studying Genetic Variation in Populations Suyash Shringarpure, 2012

Data Mining Meets HCI: Making Sense of Large Graphs Duen Horng (Polo) Chau, 2012

Learning with Limited Supervision by Input and Output Coding Yi Zhang, 2012

Target Sequence Clustering Benjamin Shih, 2011

Nonparametric Learning in High Dimensions Han Liu, 2010 (Joint Statistics & ML PhD)

Structural Analysis of Large Networks: Observations and Applications Mary McGlohon, 2010

Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy Brian D. Ziebart, 2010

Tractable Algorithms for Proximity Search on Large Graphs Purnamrita Sarkar, 2010

Rare Category Analysis Jingrui He, 2010

Coupled Semi-Supervised Learning Andrew Carlson, 2010

Fast Algorithms for Querying and Mining Large Graphs Hanghang Tong, 2009

Efficient Matrix Models for Relational Learning Ajit Paul Singh, 2009

Exploiting Domain and Task Regularities for Robust Named Entity Recognition Andrew O. Arnold, 2009

Theoretical Foundations of Active Learning Steve Hanneke, 2009

Generalized Learning Factors Analysis: Improving Cognitive Models with Machine Learning Hao Cen, 2009

Detecting Patterns of Anomalies Kaustav Das, 2009

Dynamics of Large Networks Jurij Leskovec, 2008

Computational Methods for Analyzing and Modeling Gene Regulation Dynamics Jason Ernst, 2008

Stacked Graphical Learning Zhenzhen Kou, 2007

Actively Learning Specific Function Properties with Applications to Statistical Inference Brent Bryan, 2007

Approximate Inference, Structure Learning and Feature Estimation in Markov Random Fields Pradeep Ravikumar, 2007

Scalable Graphical Models for Social Networks Anna Goldenberg, 2007

Measure Concentration of Strongly Mixing Processes with Applications Leonid Kontorovich, 2007

Tools for Graph Mining Deepayan Chakrabarti, 2005

Automatic Discovery of Latent Variable Models Ricardo Silva, 2005

dissertations on ai

dissertations on ai

Funded by Institute for Computational and Data Sciences

UNIVERSITY PARK, Pa. — Graduate students across different disciplines have been nominated for dissertation awards in different categories including social impact, undergraduate thesis and graduate thesis and more. 

This is the second annual university-wide dissertation competition that “recognizes excellence among our students,” according to Evan Pugh University Professor and A. Robert Noll Chair Professor of Computer Science and Engineering and Electrical Engineering, Vijaykrishnan Narayanan. 

“Our students are publishing their work in top-tier journals in their disciplines,” Narayanan said. 

Narayanan, also the director for the Center for Artificial Intelligence Foundations and Engineered Systems (CAFE), has been a part of the dissertation competition’s initiation process. 

Dissertations are judged through an anonymous panel in two stages. 

Through the AI Hub, there have been three AI dissertations nominated and named finalists. 

AI dissertation finalist, Junjie Liang and winner, Zhaohui Li, reflected on their process. 

Li’s dissertation addressed challenges that STEM (Science Technology Education Mathematics) educators face in grading selected response questions like multiple choice questions, against the benefits of constructed response questions, which students are required to further articulate their own reasoning. Li developed novel Natural Language Processing methods to support the use of constructed response questions in STEM education for formative assessment, where students receive feedback during instruction.  

“I found writing my dissertation both enjoyable and challenging,” Li said. “It’s rewarding because it integrates all my previous research, allowing me to reflect deeply on the essence and contributions of my work, which in turn inspires my future research directions. Also, it has a social impact since the research explored how to extend human-machine collaboration research into real classroom settings.” 

“Receiving this award is incredibly inspiring and means a great deal to me,” Li continued. “It truly motivates my future research, as the recognition from the AI dissertation committee affirms that I am on the right path and making valuable contributions. I am committed to continuing my efforts to improve the world. I believe this will continue to motivate and inspire future PhD researchers.”

Liang had similar experiences to Li. 

Liang’s research highlighted longitudinal data’s prevalence across various disciplines including electronic health records, social sciences and finance and was dedicated to the development of effective machine-learning models tailored for “high dimensional longitudinal data.” The data compiled comprises “irregular movements collected from a large cohort of subjects over extended periods, often exhibiting intricate multilevel correlations,” according to Liang. 

“My research journey has been filled with challenges, successes and excitement,” Liang said. “Unlike widely recognized machine learning fields such as computer vision and Natural Language Processing, longitudinal data analysis is a relatively understated topic, despite its presence over 50 years.” 

“I have achieved a significant breakthrough, developing innovative solutions to the primary challenges in contemporary longitudinal data analysis,” Liang said.  

Li and Liang equally expressed gratitude for their advisors and professors, their families and friends for their guidance and support during the process. 

“Dr. Rebecca Passonneau (Becky) emphasized that our research should transcend the confines of laboratory algorithms and instead, actively benefit real-world classrooms and society,” Li said.  

Passonneau, Li’s advisor and professor, Penn State School of Electrical Engineering and Computer Science, offered advice to doctoral students to continue to pursue their dreams, to show up and step up for your life because “you’ll find that you tap into resources you didn’t know you had.” 

“These dissertations [and research] have foundational value to the discipline of AI and for land-grant institutes like Penn State,” Narayanan said. “We are seeing some relevant impact on society in addition to foundational contributions. The work of our students has so much influence on our society. We want to remind people that we showcase and spotlight the finalists, but there are so many wonderful applicants throughout the process.” 

Narayanan urges faculty and colleagues to continue to think about nominations for the next dissertation award nominations for next year. 

Directors of the various Penn State AI organizations were key partners in this award process including AI Director David Hunter, the Center for Applications of AI/ML to Industry (AIMI), the Center for Socially Responsible AI (CSRAI), the Center for Artificial Intelligence Foundations and Scientific Applications (CENSAI) and the Nittany AI Alliance. 

Penn State is one of only three universities to hold the distinction of being a LAND, SPACE, SUN and SEA grant institution.

dissertations on ai

SPACE GRANT

dissertations on ai

  • 304 Old Main, University Park, Pennsylvania 16802
  • 814-867-1467

Supported by

dissertations on ai

Other AI Resources

  • Nittany AI Alliance
  • AI Research Consulting with the RISE team
  • Roar and Roar Collab Computing Infrastructure

PENN STATE AI CENTERS

  • The Center for Artificial Intelligence Foundations and Scientific Applications (CENSAI)
  • The Center for Applications of Artificial Intelligence and Machine Learning to Industry (AIMI)
  • The Center for Socially Responsible Artificial Intelligence (CSRAI)
  • The Center for AI Foundations and Engineering Systems (CAFÉ)
  • Privacy Statement
  • Non Discrimination
  • Accessibility
  • Equal Opportunity
  • Legal Statements
  • The Pennsylvania State University © 2023

Subscribe Today & get all the latest Penn State AI News in your inbox!

Artificial Intelligence

Completed Theses

State space search solves navigation tasks and many other real world problems. Heuristic search, especially greedy best-first search, is one of the most successful algorithms for state space search. We improve the state of the art in heuristic search in three directions.

In Part I, we present methods to train neural networks as powerful heuristics for a given state space. We present a universal approach to generate training data using random walks from a (partial) state. We demonstrate that our heuristics trained for a specific task are often better than heuristics trained for a whole domain. We show that the performance of all trained heuristics is highly complementary. There is no clear pattern, which trained heuristic to prefer for a specific task. In general, model-based planners still outperform planners with trained heuristics. But our approaches exceed the model-based algorithms in the Storage domain. To our knowledge, only once before in the Spanner domain, a learning-based planner exceeded the state-of-the-art model-based planners.

A priori, it is unknown whether a heuristic, or in the more general case a planner, performs well on a task. Hence, we trained online portfolios to select the best planner for a task. Today, all online portfolios are based on handcrafted features. In Part II, we present new online portfolios based on neural networks, which receive the complete task as input, and not just a few handcrafted features. Additionally, our portfolios can reconsider their choices. Both extensions greatly improve the state-of-the-art of online portfolios. Finally, we show that explainable machine learning techniques, as the alternative to neural networks, are also good online portfolios. Additionally, we present methods to improve our trust in their predictions.

Even if we select the best search algorithm, we cannot solve some tasks in reasonable time. We can speed up the search if we know how it behaves in the future. In Part III, we inspect the behavior of greedy best-first search with a fixed heuristic on simple tasks of a domain to learn its behavior for any task of the same domain. Once greedy best-first search expanded a progress state, it expands only states with lower heuristic values. We learn to identify progress states and present two methods to exploit this knowledge. Building upon this, we extract the bench transition system of a task and generalize it in such a way that we can apply it to any task of the same domain. We can use this generalized bench transition system to split a task into a sequence of simpler searches.

In all three research directions, we contribute new approaches and insights to the state of the art, and we indicate interesting topics for future work.

Greedy best-first search (GBFS) is a sibling of A* in the family of best-first state-space search algorithms. While A* is guaranteed to find optimal solutions of search problems, GBFS does not provide any guarantees but typically finds satisficing solutions more quickly than A*. A classical result of optimal best-first search shows that A* with admissible and consistent heuristic expands every state whose f-value is below the optimal solution cost and no state whose f-value is above the optimal solution cost. Theoretical results of this kind are useful for the analysis of heuristics in different search domains and for the improvement of algorithms. For satisficing algorithms a similarly clear understanding is currently lacking. We examine the search behavior of GBFS in order to make progress towards such an understanding.

We introduce the concept of high-water mark benches, which separate the search space into areas that are searched by GBFS in sequence. High-water mark benches allow us to exactly determine the set of states that GBFS expands under at least one tie-breaking strategy. We show that benches contain craters. Once GBFS enters a crater, it has to expand every state in the crater before being able to escape.

Benches and craters allow us to characterize the best-case and worst-case behavior of GBFS in given search instances. We show that computing the best-case or worst-case behavior of GBFS is NP-complete in general but can be computed in polynomial time for undirected state spaces.

We present algorithms for extracting the set of states that GBFS potentially expands and for computing the best-case and worst-case behavior. We use the algorithms to analyze GBFS on benchmark tasks from planning competitions under a state-of-the-art heuristic. Experimental results reveal interesting characteristics of the heuristic on the given tasks and demonstrate the importance of tie-breaking in GBFS.

Classical planning tackles the problem of finding a sequence of actions that leads from an initial state to a goal. Over the last decades, planning systems have become significantly better at answering the question whether such a sequence exists by applying a variety of techniques which have become more and more complex. As a result, it has become nearly impossible to formally analyze whether a planning system is actually correct in its answers, and we need to rely on experimental evidence.

One way to increase trust is the concept of certifying algorithms, which provide a witness which justifies their answer and can be verified independently. When a planning system finds a solution to a problem, the solution itself is a witness, and we can verify it by simply applying it. But what if the planning system claims the task is unsolvable? So far there was no principled way of verifying this claim.

This thesis contributes two approaches to create witnesses for unsolvable planning tasks. Inductive certificates are based on the idea of invariants. They argue that the initial state is part of a set of states that we cannot leave and that contains no goal state. In our second approach, we define a proof system that proves in an incremental fashion that certain states cannot be part of a solution until it has proven that either the initial state or all goal states are such states.

Both approaches are complete in the sense that a witness exists for every unsolvable planning task, and can be verified efficiently (in respect to the size of the witness) by an independent verifier if certain criteria are met. To show their applicability to state-of-the-art planning techniques, we provide an extensive overview how these approaches can cover several search algorithms, heuristics and other techniques. Finally, we show with an experimental study that generating and verifying these explanations is not only theoretically possible but also practically feasible, thus making a first step towards fully certifying planning systems.

Heuristic search with an admissible heuristic is one of the most prominent approaches to solving classical planning tasks optimally. In the first part of this thesis, we introduce a new family of admissible heuristics for classical planning, based on Cartesian abstractions, which we derive by counterexample-guided abstraction refinement. Since one abstraction usually is not informative enough for challenging planning tasks, we present several ways of creating diverse abstractions. To combine them admissibly, we introduce a new cost partitioning algorithm, which we call saturated cost partitioning. It considers the heuristics sequentially and uses the minimum amount of costs that preserves all heuristic estimates for the current heuristic before passing the remaining costs to subsequent heuristics until all heuristics have been served this way.

In the second part, we show that saturated cost partitioning is strongly influenced by the order in which it considers the heuristics. To find good orders, we present a greedy algorithm for creating an initial order and a hill-climbing search for optimizing a given order. Both algorithms make the resulting heuristics significantly more accurate. However, we obtain the strongest heuristics by maximizing over saturated cost partitioning heuristics computed for multiple orders, especially if we actively search for diverse orders.

The third part provides a theoretical and experimental comparison of saturated cost partitioning and other cost partitioning algorithms. Theoretically, we show that saturated cost partitioning dominates greedy zero-one cost partitioning. The difference between the two algorithms is that saturated cost partitioning opportunistically reuses unconsumed costs for subsequent heuristics. By applying this idea to uniform cost partitioning we obtain an opportunistic variant that dominates the original. We also prove that the maximum over suitable greedy zero-one cost partitioning heuristics dominates the canonical heuristic and show several non-dominance results for cost partitioning algorithms. The experimental analysis shows that saturated cost partitioning is the cost partitioning algorithm of choice in all evaluated settings and it even outperforms the previous state of the art in optimal classical planning.

Classical planning is the problem of finding a sequence of deterministic actions in a state space that lead from an initial state to a state satisfying some goal condition. The dominant approach to optimally solve planning tasks is heuristic search, in particular A* search combined with an admissible heuristic. While there exist many different admissible heuristics, we focus on abstraction heuristics in this thesis, and in particular, on the well-established merge-and-shrink heuristics.

Our main theoretical contribution is to provide a comprehensive description of the merge-and-shrink framework in terms of transformations of transition systems. Unlike previous accounts, our description is fully compositional, i.e. can be understood by understanding each transformation in isolation. In particular, in addition to the name-giving merge and shrink transformations, we also describe pruning and label reduction as such transformations. The latter is based on generalized label reduction, a new theory that removes all of the restrictions of the previous definition of label reduction. We study the four types of transformations in terms of desirable formal properties and explain how these properties transfer to heuristics being admissible and consistent or even perfect. We also describe an optimized implementation of the merge-and-shrink framework that substantially improves the efficiency compared to previous implementations.

Furthermore, we investigate the expressive power of merge-and-shrink abstractions by analyzing factored mappings, the data structure they use for representing functions. In particular, we show that there exist certain families of functions that can be compactly represented by so-called non-linear factored mappings but not by linear ones.

On the practical side, we contribute several non-linear merge strategies to the merge-and-shrink toolbox. In particular, we adapt a merge strategy from model checking to planning, provide a framework to enhance existing merge strategies based on symmetries, devise a simple score-based merge strategy that minimizes the maximum size of transition systems of the merge-and-shrink computation, and describe another framework to enhance merge strategies based on an analysis of causal dependencies of the planning task.

In a large experimental study, we show the evolution of the performance of merge-and-shrink heuristics on planning benchmarks. Starting with the state of the art before the contributions of this thesis, we subsequently evaluate all of our techniques and show that state-of-the-art non-linear merge-and-shrink heuristics improve significantly over the previous state of the art.

Admissible heuristics are the main ingredient when solving classical planning tasks optimally with heuristic search. Higher admissible heuristic values are more accurate, so combining them in a way that dominates their maximum and remains admissible is an important problem.

The thesis makes three contributions in this area. Extensions to cost partitioning (a well-known heuristic combination framework) allow to produce higher estimates from the same set of heuristics. The new heuristic family called operator-counting heuristics unifies many existing heuristics and offers a new way to combine them. Another new family of heuristics called potential heuristics allows to cast the problem of finding a good heuristic as an optimization problem.

Both operator-counting and potential heuristics are closely related to cost partitioning. They offer a new look on cost partitioned heuristics and already sparked research beyond their use as classical planning heuristics.

Master's theses

Optimal planning is an ongoing topic of research, and requires efficient heuristic search algorithms. One way of calculating such heuristics is through the use of Linear Programs (LPs) and solvers thereof. This thesis investigates the efficiency of LP-based heuristic search strategies of different heuristics, focusing on how different LP solving strategies and solver settings impact the performance of calculating these heuristics. Using the Fast Downward planning system and a comprehensive benchmark set of planning tasks, we conducted a series of experiments to determine the effectiveness of the primal and dual simplex methods and the primal-dual logarithmic barrier method. Our results show that the choice of the LP solver and the application of specific solver settings influence the efficiency of calculating the required heuristics, and showed that the default setting of CPLEX is not optimal in some cases and can be enhanced by specifying an LP-solver or using other non-default solver settings. This thesis lays the groundwork for future research of using different LP solving algorithms and solver settings in the context of LP-based heuristic search in optimal planning.

Classical planning tasks are typically formulated in PDDL. Some of them can be described more concisely using derived variables. Contrary to basic variables, their values cannot be changed by operators and are instead determined by axioms which specify conditions under which they take a certain value. Planning systems often support axioms in their search component, but their heuristics’ support is limited or nonexistent. This leads to decreased search performance with tasks that use axioms. We compile axioms away using our implementation of a known algorithm in the Fast Downward planner. Our results show that the compilation has a negative impact on search performance with its only benefit being the ability to use heuristics that have no axiom support. As a compromise between performance and expressivity, we identify axioms of a simple form and devise a compilation for them. We compile away all axioms in several of the tested domains without a decline in search performance.

The International Planning Competitions (IPCs) serve as a testing suite for planning systems. These domains are well-motivated as they are derived from, or possess characteristics analogous to real-life applications. In this thesis, we study the computational complexity of the plan existence and bounded plan existence decision problems of the following grid-based IPC domains: VisitAll, TERMES, Tidybot, Floortile, and Nurikabe. In all of these domains, there are one or more agents moving through a rectangular grid (potentially with obstacles) performing actions along the way. In many cases, we engineer instances that can be solved only if the movement of the agent or agents follows a Hamiltonian path or cycle in a grid graph. This gives rise to many NP-hardness reductions from Hamiltonian path/cycle problems on grid graphs. In the case of VisitAll and Floortile, we give necessary and sufficient conditions for deciding the plan existence problem in polynomial time. We also show that Tidybot has the game Push -1F as a special case, and its plan existence problem is thus PSPACE-complete. The hardness proofs in this thesis highlight hard instances of these domains. Moreover, by assigning a complexity class to each domain, researchers and practitioners can better assess the strengths and limitations of new and existing algorithms in these domains.

Planning tasks can be used to describe many real world problems of interest. Solving those tasks optimally is thus an avenue of great interest. One established and successful approach for optimal planning is the merge-and-shrink framework, which decomposes the task into a factored transition system. The factors initially represent the behaviour of one state variable and are repeatedly combined and abstracted. The solutions of these abstract states is then used as a heuristic to guide search in the original planning task. Existing merge-and-shrink transformations keep the factored transition system orthogonal, meaning that the variables of the planning task are represented in no more than one factor at any point. In this thesis we introduce the clone transformation, which duplicates a factor of the factored transition system, making it non-orthogonal. We test two classes of clone strategies, which we introduce and implement in the Fast Downward planning system and conclude that, while theoretically promising, our clone strategies are practically inefficient as their performance was worse than state-of-the-art methods for merge-and-shrink.

This thesis aims to present a novel approach for improving the performance of classical planning algorithms by integrating cost partitioning with merge-and-shrink techniques. Cost partitioning is a well-known technique for admissibly adding multiple heuristic values. Merge-and-shrink, on the other hand, is a technique to generate well-informed abstractions. The "merge” part of the technique is based on creating an abstract representation of the original problem by replacing two transition systems with their synchronised product. In contrast, the ”shrink” part refers to reducing the size of the factor. By combining these two approaches, we aim to leverage the strengths of both methods to achieve better scalability and efficiency in solving classical planning problems. Considering a range of benchmark domains and the Fast Downward planning system, the experimental results show that the proposed method achieves the goal of fusing merge and shrink with cost partitioning towards better outcomes in classical planning.

Planning is the process of finding a path in a planning task from the initial state to a goal state. Multiple algorithms have been implemented to solve such planning tasks, one of them being the Property-Directed Reachability algorithm. Property-Directed Reachability utilizes a series of propositional formulas called layers to represent a super-set of states with a goal distance of at most the layer index. The algorithm iteratively improves the layers such that they represent a minimum number of states. This happens by strengthening the layer formulas and therefore excluding states with a goal distance higher than the layer index. The goal of this thesis is to implement a pre-processing step to seed the layers with a formula that already excludes as many states as possible, to potentially improve the run-time performance. We use the pattern database heuristic and its associated pattern generators to make use of the planning task structure for the seeding algorithm. We found that seeding does not consistently improve the performance of the Property-Directed Reachability algorithm. Although we observed a significant reduction in planning time for some tasks, it significantly increased for others.

Certifying algorithms is a concept developed to increase trust by demanding affirmation of the computed result in form of a certificate. By inspecting the certificate, it is possible to determine correctness of the produced output. Modern planning systems have been certifying for long time in the case of solvable instances, where a generated plan acts as a certificate.

Only recently there have been the first steps towards certifying unsolvability judgments in the form of inductive certificates which represent certain sets of states. Inductive certificates are expressed with the help of propositional formulas in a specific formalism.

In this thesis, we investigate the use of propositional formulas in conjunctive normal form (CNF) as a formalism for inductive certificates. At first, we look into an approach that allows us to construct formulas representing inductive certificates in CNF. To show general applicability of this approach, we extend this to the family of delete relaxation heuristics. Furthermore, we present how a planning system is able to generate an inductive validation formula, a single formula that can be used to validate if the set found by the planner is indeed an inductive certificate. At last, we show with an experimental evaluation that the CNF formalism can be feasible in practice for the generation and validation of inductive validation formulas.

In generalized planning the aim is to solve whole classes of planning tasks instead of single tasks one at a time. Generalized representations provide information or knowledge about such classes to help solving them. This work compares the expressiveness of three generalized representations, generalized potential heuristics, policy sketches and action schema networks, in terms of compilability. We use a notion of equivalence that requires two generalized representations to decompose the tasks of a class into the same subtasks. We present compilations between pairs of equivalent generalized representations and proofs where a compilation is impossible.

A Digital Microfluidic Biochip (DMFB) is a digitally controllable lab-on-a-chip. Droplets of fluids are moved, merged and mixed on a grid. Routing these droplets efficiently has been tackled by various different approaches. We try to use temporal planning to do droplet routing, inspired by the use of it in quantum circuit compilation. We test a model for droplet routing in both classical and temporal planning and compare both versions. We show that our classical planning model is an efficient method to find droplet routes on DMFBs. Then we extend our model and include spawning, disposing, merging, splitting and mixing of droplets. The results of these extensions show that we are able to find plans for simple experiments. When scaling the problem size to real life experiments our model fails to find plans.

Cost partitioning is a technique used to calculate heuristics in classical optimal planning. It involves solving a linear program. This linear program can be decomposed into a master and pricing problems. In this thesis we combine Fourier-Motzkin elimination and the double description method in different ways to precompute the generating rays of the pricing problems. We further empirically evaluate these approaches and propose a new method that replaces the Fourier-Motzkin elimination. Our new method improves the performance of our approaches with respect to runtime and peak memory usage.

The increasing number of data nowadays has contributed to new scheduling approaches. Aviation is one of the domains concerned the most, as the aircraft engine implies millions of maintenance events operated by staff worldwide. In this thesis we present a constraint programming-based algorithm to solve the aircraft maintenance scheduling problem. We want to find the best time to do the maintenance by determining which employee will perform the work and when. Here we report how the scheduling process in aviation can be automatized.

To solve stochastic state-space tasks, the research field of artificial intelligence is mainly used. PROST2014 is state of the art when determining good actions in an MDP environment. In this thesis, we aimed to provide a heuristic by using neural networks to outperform the dominating planning system PROST2014. For this purpose, we introduced two variants of neural networks that allow to estimate the respective Q-value for a pair of state and action. Since we envisaged the learning method of supervised learning, in addition to the architecture as well as the components of the neural networks, the generation of training data was also one of the main tasks. To determine the most suitable network parameters, we performed a sequential parameter search, from which we expected a local optimum of the model settings. In the end, the PROST2014 planning system could not be surpassed in the total rating evaluation. Nevertheless, in individual domains, we could establish increased final scores on the side of the neural networks. The result shows the potential of this approach and points to eventual adaptations in future work pursuing this procedure furthermore.

In classical planning, there are tasks that are hard and tasks that are easy. We can measure the complexity of a task with the correlation complexity, the improvability width, and the novelty width. In this work, we compare these measures.

We investigate what causes a correlation complexity of at least 2. To do so we translate the state space into a vector space which allows us to make use of linear algebra and convex cones.

Additionally, we introduce the Basel measure, a new measure that is based on potential heuristics and therefore similar to the correlation complexity but also comparable to the novelty width. We show that the Basel measure is a lower bound for the correlation complexity and that the novelty width +1 is an upper bound for the Basel measure.

Furthermore, we compute the Basel measure for some tasks of the International Planning Competitions and show that the translation of a task can increase the Basel measure by removing seemingly irrelevant state variables.

Unsolvability is an important result in classical planning and has seen increased interest in recent years. This thesis explores unsolvability detection by automatically generating parity arguments, a well-known way of proving unsolvability. The argument requires an invariant measure, whose parity remains constant across all reachable states, while all goal states are of the opposite parity. We express parity arguments using potential functions in the field F 2 . We develop a set of constraints that describes potential functions with the necessary separating property, and show that the constraints can be represented efficiently for up to two-dimensional features. Enhanced with mutex information, an algorithm is formed that tests whether a parity function exists for a given planning task. The existence of such a function proves the task unsolvable. To determine its practical use, we empirically evaluate our approach on a benchmark of unsolvable problems and compare its performance to a state of the art unsolvability planner. We lastly analyze the arguments found by our algorithm to confirm their validity, and understand their expressive power.

We implemented the invariant synthesis algorithm proposed by Rintanen and experimentally compared it against Helmert’s mutex group synthesis algorithm as implemented in Fast Downward.

The context for the comparison is the translation of propositional STRIPS tasks to FDR tasks, which requires the identification of mutex groups.

Because of its dominating lead in translation speed, combined with few and marginal advantages in performance during search, Helmert’s algorithm is clearly better for most uses. Meanwhile Rintanen’s algorithm is capable of finding invariants other than mutexes, which Helmert’s algorithm per design cannot do.

The International Planning Competition (IPC) is a competition of state-of-the-art planning systems. The evaluation of these planning systems is done by measuring them with different problems. It focuses on the challenges of AI planning by analyzing classical, probabilistic and temporal planning and by presenting new problems for future research. Some of the probabilistic domains introduced in IPC 2018 are Academic Advising, Chromatic Dice, Cooperative Recon, Manufacturer, Push Your Luck, Red-finned Blue-eyes, etc.

This thesis aims to solve (near)-optimally two probabilistic IPC 2018 domains, Academic Advising and Chromatic Dice. We use different techniques to solve these two domains. In Academic Advising, we use a relevance analysis to remove irrelevant actions and state variables from the planning task. We then convert the problem from probabilistic to classical planning, which helped us solve it efficiently. In Chromatic Dice, we implement backtracking search to solve the smaller instances optimally. More complex instances are partitioned into several smaller planning tasks, and a near-optimal policy is derived as a combination of the optimal solutions to the small instances.

The motivation for finding (near)-optimal policies is related to the IPC score, which measures the quality of the planners. By providing the optimal upper bound of the domains, we contribute to the stabilization of the IPC score evaluation metric for these domains.

Most well-known and traditional online planners for probabilistic planning are in some way based on Monte-Carlo Tree Search. SOGBOFA, symbolic online gradient-based optimization for factored action MDPs, offers a new perspective on this: it constructs a function graph encoding the expected reward for a given input state using independence assumptions for states and actions. On this function, they use gradient ascent to perform a symbolic search optimizing the actions for the current state. This unique approach to probabilistic planning has shown very strong results and even more potential. In this thesis, we attempt to integrate the new ideas SOGBOFA presents into the traditionally successful Trial-based Heuristic Tree Search framework. Specifically, we design and evaluate two heuristics based on the aforementioned graph and its Q value estimations, but also the search using gradient ascent. We implement and evaluate these heuristics in the Prost planner, along with a version of the current standalone planner.

In this thesis, we consider cyclical dependencies between landmarks for cost-optimal planning. Landmarks denote properties that must hold at least once in all plans. However, if the orderings between them induce cyclical dependencies, one of the landmarks in each cycle must be achieved an additional time. We propose the generalized cycle-covering heuristic which considers this in addition to the cost for achieving all landmarks once.

Our research is motivated by recent applications of cycle-covering in the Freecell and logistics domain where it yields near-optimal results. We carry it over to domain-independent planning using a linear programming approach. The relaxed version of a minimum hitting set problem for the landmarks is enhanced by constraints concerned with cyclical dependencies between them. In theory, this approach surpasses a heuristic that only considers landmarks.

We apply the cycle-covering heuristic in practice where its theoretical dominance is confirmed; Many planning tasks contain cyclical dependencies and considering them affects the heuristic estimates favorably. However, the number of tasks solved using the improved heuristic is virtually unaffected. We still believe that considering this feature of landmarks offers great potential for future work.

Potential heuristics are a class of heuristics used in classical planning to guide a search algorithm towards a goal state. Most of the existing research on potential heuristics is focused on finding heuristics that are admissible, such that they can be used by an algorithm such as A* to arrive at an optimal solution. In this thesis, we focus on the computation of potential heuristics for satisficing planning, where plan optimality is not required and the objective is to find any solution. Specifically, our focus is on the computation of potential heuristics that are descending and dead-end avoiding (DDA), since these prop- erties guarantee favorable search behavior when used with greedy search algorithms such as hillclimbing. We formally prove that the computation of DDA heuristics is a PSPACE-complete problem and propose several approximation algorithms. Our evaluation shows that the resulting heuristics are competitive with established approaches such as Pattern Databases in terms of heuristic quality but suffer from several performance bottlenecks.

Most automated planners use heuristic search to solve the tasks. Usually, the planners get as input a lifted representation of the task in PDDL, a compact formalism describing the task using a fragment of first-order logic. The planners then transform this task description into a grounded representation where the task is described in propositional logic. This new grounded format can be exponentially larger than the lifted one, but many planners use this grounded representation because it is easier to implement and reason about.

However, sometimes this transformation between lifted and grounded representations is not tractable. When this is the case, there is not much that planners based on heuristic search can do. Since this transformation is a required preprocess, when this fails, the whole planner fails.

To solve the grounding problem, we introduce new methods to deal with tasks that cannot be grounded. Our work aims to find good ways to perform heuristic search while using a lifted representation of planning problems. We use the point-of-view of planning as a database progression problem and borrow solutions from the areas of relational algebra and database theory.

Our theoretical and empirical results are motivating: several instances that were never solved by any planner in the literature are now solved by our new lifted planner. For example, our planner can solve the challenging Organic Synthesis domain using a breadth-first search, while state-of-the-art planners cannot solve more than 60% of the instances. Furthermore, our results offer a new perspective and a deep theoretical study of lifted representations for planning tasks.

The generation of independently verifiable proofs for the unsolvability of planning tasks using different heuristics, including linear Merge-and-Shrink heuristics, is possible by usage of a proof system framework. Proof generation in the case of non-linear Merge-and-Shrink heuristic, however, is currently not supported. This is due to the lack of a suitable state set representation formalism that allows to compactly represent states mapped to a certain value in the belonging Merge-and-Shrink representation (MSR). In this thesis, we overcome this shortcoming using Sentential Decision Diagrams (SDDs) as set representations. We describe an algorithm that constructs the desired SDD from the MSR, and show that efficient proof verification is possible with SDDs as representation formalism. Aditionally, we use a proof of concept implementation to analyze the overhead occurred by the proof generation functionality and the runtime of the proof verification.

The operator-counting framework is a framework in classical planning for heuristics that are based on linear programming. The operator-counting framework covers several kinds of state-of-the-art linear programming heuristics, among them the post-hoc optimization heuristic. In this thesis we will use post-hoc optimization constraints and evaluate them under altered cost functions instead of the original cost function of the planning task. We show that such cost-altered post-hoc optimization constraints are also covered by the operator-counting framework and that it is possible to achieve improved heuristic estimates with them, compared with post-hoc optimization constraints under the original cost function. In our experiments we have not been able to achieve improved problem coverage, as we were not able to find a method for generating favorable cost functions that work well in all domains.

Heuristic forward search is the state-of-the-art approach to solve classical planning problems. On the other hand, bidirectional heuristic search has a lot of potential but was never able to deliver on those expectations in practice. Only recently the near-optimal bidirectional search algorithm (NBS) was introduces by Chen et al. and as the name suggests, NBS expands nearly the optimal number of states to solve any search problem. This is a novel achievement and makes the NBS algorithm a very promising and efficient algorithm in search. With this premise in mind, we raise the question of how applicable NBS is to planning. In this thesis, we inquire this very question by implementing NBS in the state- of-the-art planner Fast-Downward and analyse its performance on the benchmark of the latest international planning competition. We additionally implement fractional meet-in- the-middle and computeWVC to analyse NBS’ performance more thoroughly in regards to the structure of the problem task.

The conducted experiments show that NBS can successfully be applied to planning as it was able to consistently outperform A*. Especially good results were achieved on the domains: blocks, driverlog, floortile-opt11-strips, get-opt14-strips, logistics00, and termes- opt18-strips. Analysing these results, we deduce that the efficiency of forward and backward search depends heavily upon the underlying implicit structure of the transition system which is induced by the problem task. This suggests that bidirectional search is inherently more suited for certain problems. Furthermore, we find that this aptitude for a certain search direction correlates with the domain, thereby providing a powerful analytic tool to a priori derive the effectiveness of certain search approaches.

In conclusion, even without intricate improvements the NBS algorithm is able to compete with A*. It therefore has further potential for future research. Additionally, the underlying transition system of a problem instance is shown to be an important factor which influences the efficiency of certain search approaches. This knowledge could be valuable for devising portfolio planners.

Multiple Sequence Alignment (MSA) is the problem of aligning multiple biological sequences in the evoluationary most plausible way. It can be viewed as a shortest path problem through an n-dimensional lattice. Because of its large branching factor of 2^n − 1, it has found broad attention in the artificial intelligence community. Finding a globally optimal solution for more than a few sequences requires sophisticated heuristics and bounding techniques in order to solve the problem in acceptable time and within memory limitations. In this thesis, we show how existing heuristics fall into the category of combining certain pattern databases. We combine arbitrary pattern collections that can be used as heuristic estimates and apply cost partitioning techniques from classical planning for MSA. We implement two of those heuristics for MSA and compare their estimates to the existing heuristics.

Increasing Cost Tree Search is a promising approach to multi-agent pathfinding problems, but like all approaches it has to deal with a huge number of possible joint paths, growing exponentially with the number of agents. We explore the possibility of reducing this by introducing a value abstraction to the Multi-valued Decision Diagrams used to represent sets of joint paths. To that end we introduce a heat map to heuristically judge how collisionprone agent positions are and present how to use and possible refine abstract positions in order to still find valid paths.

Estimating cheapest plan costs with the help of network flows is an established technique. Plans and network flows are already very similar, however network flows can differ from plans in the presence of cycles. If a transition system contains cycles, flows might be composed of multiple disconnected parts. This discrepancy can make the cheapest plan estimation worse. One idea to get rid of the cycles works by introducing time steps. For every time step the states of a transition system are copied. Transitions will be changed, so that they connect states only with states of the next time step, which ensures that there are no cycles. It turned out, that by applying this idea to multiple transitions systems, network flows of the individual transition systems can be synchronized via the time steps to get a new kind of heuristic, that will also be discussed in this thesis.

Probabilistic planning is a research field that has become popular in the early 1990s. It aims at finding an optimal policy which maximizes the outcome of applying actions to states in an environment that feature unpredictable events. Such environments can consist of a large number of states and actions which make finding an optimal policy intractable using classical methods. Using a heuristic function for a guided search allows for tackling such problems. Designing a domain-independent heuristic function requires complex algorithms which may be expensive when it comes to time and memory consumption.

In this thesis, we are applying the supervised learning techniques for learning two domain-independent heuristic functions. We use three types of gradient descent methods: stochastic, batch and mini-batch gradient descent and their improved versions using momen- tum, learning decay rate and early stopping. Furthermore, we apply the concept of feature combination in order to better learn the heuristic functions. The learned functions are pro- vided to Prost, a domain-independent probabilistic planner, and benchmarked against the winning algorithms of the International Probabilistic Planning Competition held in 2014. The experiments show that learning an offline heuristic improves the overall score of the search for some of the domains used in aforementioned competition.

The merge-and-shrink heuristic is a state-of-the-art admissible heuristic that is often used for optimal planning. Recent studies showed that the merge strategy is an important factor for the performance of the merge-and-shrink algorithm. There are many different merge strategies and improvements for merge strategies described in the literature. One out of these merge strategies is MIASM by Fan et al. MIASM tries to merge transition systems that produce unnecessary states in their product which can be pruned. Another merge strategy is the symmetry-based merge-and-shrink framework by Sievers et al. This strategy tries to merge transition systems that cause factored symmetries in their product. This strategy can be combined with other merge strategies and it often improves the performance for many merge strategy. However, the current combination of MIASM with factored symmetries performs worse than MIASM. We implement a different combination of MIASM that uses factored symmetries during the subset search of MIASM. Our experimental evaluation shows that our new combination of MIASM with factored symmetries solves more tasks than the existing MIASM and the previously implemented combination of MIASM with factored symmetries. We also evaluate different combinations of existing merge strategies and find combinations that perform better than their basic version that were not evaluated before.

Tree Cache is a pathfinding algorithm that selects one vertex as a root and constructs a tree with cheapest paths to all other vertices. A path is found by traversing up the tree from both the start and goal vertices to the root and concatenating the two parts. This is fast, but as all paths constructed this way pass through the root vertex they can be highly suboptimal.

To improve this algorithm, we consider two simple approaches. The first is to construct multiple trees, and save the distance to each root in each vertex. To find a path, the algorithm first selects the root with the lowest total distance. The second approach is to remove redundant vertices, i.e. vertices that are between the root and the lowest common ancestor (LCA) of the start and goal vertices. The performance and space requirements of the resulting algorithm are then compared to the conceptually similar hub labels and differential heuristics.

Greedy Best-First Search (GBFS) is a prominent search algorithm to find solutions for planning tasks. GBFS chooses nodes for further expansion based on a distance-to-goal estimator, the heuristic. This makes GBFS highly dependent on the quality of the heuristic. Heuristics often face the problem of producing Uninformed Heuristic Regions (UHRs). GBFS additionally suffers the possibility of simultaneously expanding nodes in multiple UHRs. In this thesis we change the heuristic approach in UHRs. The heuristic was unable to guide the search and so we try to expand novel states to escape the UHRs. The novelty measures how “new” a state is in the search. The result is a combination of heuristic and novelty guided search, which is indeed able to escape UHRs quicker and solve more problems in reasonable time.

In classical AI planning, the state explosion problem is a reoccurring subject: although the problem descriptions are compact, often a huge number of states needs to be considered. One way to tackle this problem is to use static pruning methods which reduce the number of variables and operators in the problem description before planning.

In this work, we discuss the properties and limitations of three existing static pruning techniques with a focus on satisficing planning. We analyse these pruning techniques and their combinations, and identify synergy effects between them and the domains and problem structures in which they occur. We implement the three methods into an existing propositional planner, and evaluate the performance of different configurations and combinations in a set of experiments on IPC benchmarks. We observe that static pruning techniques can increase the number of solved problems, and that the synergy effects of the combinations also occur on IPC benchmarks, although they do not lead to a major performance increase.

The goal of classical domain-independent planning is to find a sequence of actions which lead from a given initial state to a goal state that satisfies some goal criteria. Most planning systems use heuristic search algorithms to find such a sequence of actions. A critical part of heuristic search is the heuristic function. In order to find a sequence of actions from an initial state to a goal state efficiently this heuristic function has to guide the search towards the goal. It is difficult to create such an efficient heuristic function. Arfaee et al. show that it is possible to improve a given heuristic function by applying machine learning techniques on a single domain in the context of heuristic search. To achieve this improvement of the heuristic function, they propose a bootstrap learning approach which subsequently improves the heuristic function.

In this thesis we will introduce a technique to learn heuristic functions that can be used in classical domain-independent planning based on the bootstrap-learning approach introduced by Arfaee et al. In order to evaluate the performance of the learned heuristic functions, we have implemented a learning algorithm for the Fast Downward planning system. The experiments have shown that a learned heuristic function generally decreases the number of explored states compared to blind-search . The total time to solve a single problem increases because the heuristic function has to be learned before it can be applied.

Essential for the estimation of the performance of an algorithm in satisficing planning is its ability to solve benchmark problems. Those results can not be compared directly as they originate from different implementations and different machines. We implemented some of the most promising algorithms for greedy best-first search, published in the last years, and evaluated them on the same set of benchmarks. All algorithms are either based on randomised search, localised search or a combination of both. Our evaluation proves the potential of those algorithms.

Heuristic search with admissible heuristics is the leading approach to cost-optimal, domain-independent planning. Pattern database heuristics - a type of abstraction heuristics - are state-of-the-art admissible heuristics. Two recent pattern database heuristics are the iPDB heuristic by Haslum et al. and the PhO heuristic by Pommerening et al.

The iPDB procedure performs a hill climbing search in the space of pattern collections and evaluates selected patterns using the canonical heuristic. We apply different techniques to the iPDB procedure, improving its hill climbing algorithm as well as the quality of the resulting heuristic. The second recent heuristic - the PhO heuristic - obtains strong heuristic values through linear programming. We present different techniques to influence and improve on the PhO heuristic.

We evaluate the modified iPDB and PhO heuristics on the IPC benchmark suite and show that these abstraction heuristics can compete with other state-of-the-art heuristics in cost-optimal, domain-independent planning.

Greedy best-first search (GBFS) is a prominent search algorithm for satisficing planning - finding good enough solutions to a planning task in reasonable time. GBFS selects the next node to consider based on the most promising node estimated by a heuristic function. However, this behaviour makes GBFS heavily depend on the quality of the heuristic estimator. Inaccurate heuristics can lead GBFS into regions far away from a goal. Additionally, if the heuristic ranks several nodes the same, GBFS has no information on which node it shall follow. Diverse best-first search (DBFS) is a new algorithm by Imai and Kishimoto [2011] which has a local search component to emphasis exploitation. To enable exploration, DBFS deploys probabilities to select the next node.

In two problem domains, we analyse GBFS' search behaviour and present theoretical results. We evaluate these results empirically and compare DBFS and GBFS on constructed as well as on provided problem instances.

State-of-the-art planning systems use a variety of control knowledge in order to enhance the performance of heuristic search. Unfortunately most forms of control knowledge use a specific formalism which makes them hard to combine. There have been several approaches which describe control knowledge in Linear Temporal Logic (LTL). We build upon this work and propose a general framework for encoding control knowledge in LTL formulas. The framework includes a criterion that any LTL formula used in it must fulfill in order to preserve optimal plans when used for pruning the search space; this way the validity of new LTL formulas describing control knowledge can be checked. The framework is implemented on top of the Fast Downward planning system and is tested with a pruning technique called Unnecessary Action Application, which detects if a previously applied action achieved no useful progress.

Landmarks are known to be useable for powerful heuristics for informed search. In this thesis, we explain and evaluate a novel algorithm to find ordered landmarks of delete free tasks by intersecting solutions in the relaxation. The proposed algorithm efficiently finds landmarks and natural orders of delete free tasks, such as delete relaxations or Pi-m compilations.

Planning as heuristic search is the prevalent technique to solve planning problems of any kind of domains. Heuristics estimate distances to goal states in order to guide a search through large state spaces. However, this guidance is sometimes moderate, since still a lot of states lie on plateaus of equally prioritized states in the search space topology. Additional techniques that ignore or prefer some actions for solving a problem are successful to support the search in such situations. Nevertheless, some action pruning techniques lead to incomplete searches.

We propose an under-approximation refinement framework for adding actions to under-approximations of planning tasks during a search in order to find a plan. For this framework, we develop a refinement strategy. Starting a search on an initial under-approximation of a planning task, the strategy adds actions determined at states close to a goal, whenever the search does not progress towards a goal, until a plan is found. Key elements of this strategy consider helpful actions and relaxed plans for refinements. We have implemented the under-approximation refinement framework into the greedy best first search algorithm. Our results show considerable speedups for many classical planning problems. Moreover, we are able to plan with fewer actions than standard greedy best first search.

The main approach for classical planning is heuristic search. Many cost heuristics are based on the delete relaxation. The optimal heuristic of a delete free planning problem is called h + . This thesis explores two new ways to compute h + . Both approaches use factored planning, which decomposes the original planning problem to work on each subproblem separately. The algorithm reuses the subsolutions and combines them to a global solution.

The two algorithms are used to compute a cost heuristic for an A* search. As both approaches compute the optimal heuristic for delete free planning tasks, the algorithms can also be used to find a solution for relaxed planning tasks.

Multi-Agent-Path-Finding (MAPF) is a common problem in robotics and memory management. Pebbles in Motion is an implementation of a problem solver for MAPF in polynomial time, based on a work by Daniel Kornhauser from 1984. Recently a lot of research papers have been published on MAPF in the research community of Artificial Intelligence, but the work by Kornhauser seems hardly to be taken into account. We assumed that this might be related to the fact that said paper was more mathematically and hardly describing algorithms intuitively. This work aims at filling this gap, by providing an easy understandable approach of implementation steps for programmers and a new detailed description for researchers in Computer Science.

Bachelor's theses

Fast Downward is a classical planner using heuristical search. The planner uses many advanced planning techniques that are not easy to teach, since they usually rely on complex data structures. To introduce planning techniques to the user an interactive application is created. This application uses an illustrative example to showcase planning techniques: Blocksworld

Blocksworld is an easy understandable planning problem which allows a simple representation of a state space. It is implemented in the Unreal Engine and provides an interface to the Fast Downward planner. Users can explore a state space themselves or have Fast Downward generate plans for them. The concept of heuristics as well as the state space are explained and made accessible to the user. The user experiences how the planner explores a state space and which techniques the planner uses.

This thesis is about implementing Jussi Rintanen’s algorithm for schematic invariants. The algo- rithm is implemented in the planning tool Fast Downward and refers to Rintanen’s paper Schematic Invariants by Reduction to Ground Invariants. The thesis describes all necessary definitions to under- stand the algorithm and draws a comparison between the original task and a reduced task in terms of runtime and number of grounded actions.

Planning is a field of Artificial Intelligence. Planners are used to find a sequence of actions, to get from the initial state to a goal state. Many planning algorithms use heuristics, which allow the planner to focus on more promising paths. Pattern database heuristics allow us to construct such a heuristic, by solving a simplified version of the problem, and saving the associated costs in a pattern database. These pattern databases can be computed and stored by using symbolic data structures.

In this paper we will look at how pattern databases using symbolic data structures using binary decision diagrams and algebraic decision diagrams can be implemented. We will extend fast down- ward (Helmert [2006]) with it, and compare the performance of this implementation with the already implemented explicit pattern database.

In the field of automated planning and scheduling, a planning task is essentially a state space which can be defined rigorously using one of several different formalisms (e.g. STRIPS, SAS+, PDDL etc.). A planning algorithm tries to determine a sequence of actions that lead to a goal state for a given planning task. In recent years, attempts have been made to group certain planners together into so called planner portfolios, to try and leverage their effectiveness on different specific problem classes. In our project, we create an online planner which in contrast to its offline counterparts, makes use of task specific information when allocating a planner to a task. One idea that has recently gained interest, is to apply machine learning methods to planner portfolios.

In previous work such as Delfi (Katz et al., 2018; Sievers et al., 2019a) supervised learning techniques were used, which made it necessary to train multiple networks to be able to attempt multiple, potentially different, planners for a given task. The reason for this being that, if we used the same network, the output would always be the same, as the input to the network would remain unchanged. In this project we make use of techniques from rein- forcement learning such as DQNs (Mnih et al., 2013). Using RL approaches such as DQNs, allows us to extend the input to the network to include information on things, such as which planners were previously attempted and for how long. As a result multiple attempts can be made after only having trained a single network.

Unfortunately the results show that current reinforcement learning agents are, amongst other reasons, too sample inefficient to be able to deliver viable results given the size of the currently available data sets.

Planning tasks are important and difficult problems in computer science. A widely used approach is the use of delete relaxation heuristics to which the additive and FF heuristic belong. Those two heuristics use a graph in their calculation, which only has to be constructed once for a planning task but then can be used repeatedly. To solve such a problem efficiently it is important that the calculation of the heuristics are fast. In this thesis the idea to achieve a faster calculation is to combine redundant parts of the graph when building it to reduce the number of edges and therefore speed up the calculation. Here the reduction of the redundancies is done for each action within a planning task individually, but further ideas to simplify over all actions are also discussed.

Monte Carlo search methods are widely known, mostly for their success in game domains, although they are also applied to many non-game domains. In previous work done by Schulte and Keller, it was established that best-first searches could adapt to the action selection functionality which make Monte Carlo methods so formidable. In practice however, the trial-based best first search, without exploration, was shown to be slightly slower than its explicit open list counterpart. In this thesis we examine the non-trial and trial-based searches and how they can address the exploitation exploration dilemma. Lastly, we will see how trial-based BFS can rectify a slower search by allowing occasional random action selection, by comparing it to regular open list searches in a line of experiments.

Sudoku has become one of the world’s most popular logic puzzles, arousing interest in the general public and the science community. Although the rules of Sudoku may seem simple, they allow for nearly countless puzzle instances, some of which are very hard to solve. SAT-solvers have proven to be a suitable option to solve Sudokus automatically. However, they demand the puzzles to be encoded as logical formulae in Conjunctive Normal Form. In earlier work, such encodings have been successfully demonstrated for original Sudoku Puzzles. In this thesis, we present encodings for rather unconventional Sudoku Variants, developed by the puzzle community to create even more challenging solving experiences. Furthermore, we demonstrate how Pseudo-Boolean Constraints can be utilized to encode Sudoku Variants that follow rules involving sums. To implement an encoding of Pseudo-Boolean Constraints, we use Binary Decision Diagrams and Adder Networks and study how they compare to each other.

In optimal classical planning, informed search algorithms like A* need admissible heuristics to find optimal solutions. Counterexample-guided abstraction refinement (CEGAR) is a method used to generate abstractions that yield suitable abstraction heuristics iteratively. In this thesis, we propose a class of CEGAR algorithms for the generation of domain abstractions, which are a class of abstractions that rank in between projections and Cartesian abstractions regarding the grade of refinement they allow. As no known algorithm constructs domain abstractions, we show that our algorithm is competitive with CEGAR algorithms that generate one projection or Cartesian abstraction.

This thesis will look at Single-Player Chess as a planning domain using two approaches: one where we look at how we can encode the Single-Player Chess problem as a domain-independent (general-purpose AI) approach and one where we encode the problem as a domain-specific solver. Lastly, we will compare the two approaches by doing some experiments and comparing the results of the two approaches. Both the domain-independent implementation and the domain-specific implementation differ from traditional chess engines because the task of the agent is not to find the best move for a given position and colour, but the agent’s task is to check if a given chess problem has a solution or not. If the agent can find a solution, the given chess puzzle is valid. The results of both approaches were measured in experiments, and we found out that the domain-independent implementation is too slow and that the domain-specific implementation, on the other hand, can solve the given puzzles reliably, but it has a memory bottleneck rooted in the search method that was used.

Carcassonne is a tile-based board game with a large state space and a high branching factor and therefore poses a challenge to artificial intelligence. In the past, Monte Carlo Tree Search (MCTS), a search algorithm for sequential decision-making processes, has been shown to find good solutions in large state spaces. MCTS works by iteratively building a game tree according to a tree policy. The profitability of paths within that tree is evaluated using a default policy, which influences in what directions the game tree is expanded. The functionality of these two policies, as well as other factors, can be implemented in many different ways. In consequence, many different variants of MCTS exist. In this thesis, we applied MCTS to the domain of two-player Carcassonne and evaluated different variants in regard to their performance and runtime. We found significant differences in performance for various variable aspects of MCTS and could thereby evaluate a configuration which performs best on the domain of Carcassonne. This variant consistently outperformed an average human player with a feasible runtime.

In general, it is important to verify software as it is prone to error. This also holds for solving tasks in classical planning. So far, plans in general as well as the fact that there is no plan for a given planning task can be proven and independently verified. However, no such proof for the optimality of a solution of a task exists. Our aim is to introduce two methods with which optimality can be proven and independently verified. We first reduce unit cost tasks to unsolvable tasks, which enables us to make use of the already existing certificates for unsolvability. In a second approach, we propose a proof system for optimality, which enables us to infer that the determined cost of a task is optimal. This permits the direct generation of optimality certificates.

Pattern databases are one of the most powerful heuristics in classical planning. They evaluate the perfect cost for a simplified sub-problem. The post-hoc optimization heuristic is a technique on how to optimally combine a set of pattern databases. In this thesis, we will adapt the post-hoc optimization heuristic for the sliding tile puzzle. The sliding tile puzzle serves as a benchmark to compare the post-hoc optimization heuristic to already established methods, which also deal with the combining of pattern databases. We will then show how the post-hoc optimization heuristic is an improvement over the already established methods.

In this thesis, we generate landmarks for a logistics-specific task. Landmarks are actions that need to occur at least once in every plan. A landmark graph denotes a structure with landmarks and their edges called orderings. If there are cycles in a landmark graph, one of those landmarks needs to be achieved at least twice for every cycle. The generation of the logistics-specific landmarks and their orderings calculate the cyclic landmark heuristic. The task is to pick up on related work, the evaluation of the cyclic landmark heuristic. We compare the generation of landmark graphs from a domain-independent landmark generator to a domain-specific landmark generator, the latter being the focus. We aim to bridge the gap between domain-specific and domain-independent landmark generators. In this thesis, we compare one domain-specific approach for the logistics domain with results from a domain- independent landmark generator. We devise a unit to pre-process data for other domain- specific tasks as well. We will show that specificity is better suited than independence.

Lineare Programmierung ist eine mathematische Modellierungstechnik, bei der eine lineare Funktion, unter der Berücksichtigung verschiedenen Beschränkungen, maximiert oder minimiert werden soll. Diese Technik ist besonders nützlich, falls Entscheidungen für Optimierungsprobleme getroffen werden sollen. Ziel dieser Arbeit war es ein Tool für das Spiel Factory Town zu entwickeln, mithilfe man Optimierungsanfragen bearbeiten kann. Dabei ist es möglich wahlweise zwischen diversen Fragestellungen zu wählen und anhand von LP-\ IP-Solvern diese zu beantworten. Zudem wurden die mathematischen Formulierungen, sowie die Unterschiede beider Methoden angegangen. Schlussendlich unterstrichen die generierten Resultate, dass LP Lösungen mindestens genauso gut oder sogar besser seien als die Lösungen eines IP.

Symbolic search is an important approach to classical planning. Symbolic search uses search algorithms that process sets of states at a time. For this we need states to be represented by a compact data structure called knowledge compilations. Merge-and-shrink representations come a different field of planning, where they have been used to derive heuristic functions for state-space search. More generally they represent functions that map variable assignments to a set of values, as such we can regard them as a data structure we will call Factored Mappings. In this thesis, we will investigate Factored Mappings (FMs) as a knowledge compilation language with the hope of using them for symbolic search. We will analyse the necessary transformations and queries for FMs, by defining the needed operations and a canonical representation of FMs, and showing that they run in polynomial time. We will then show that it is possible to use Factored Mappings as a knowledge compilation for symbolic search by defining a symbolic search algorithm for a finite-domain plannings task that works with FMs.

Version control systems use a graph data structure to track revisions of files. Those graphs are mutated with various commands by the respective version control system. The goal of this thesis is to formally define a model of a subset of Git commands which mutate the revision graph, and to model those mutations as a planning task in the Planning Domain Definition Language. Multiple ways to model those graphs will be explored and those models will be compared by testing them using a set of planners.

Pattern Databases are admissible abstraction heuristics for classical planning. In this thesis we are introducing the Boosting processes, which consists of enlarging the pattern of a Pattern Database P, calculating a more informed Pattern Database P' and then min-compress P' to the size of P resulting in a compressed and still admissible Pattern Database P''. We design and implement two boosting algorithms, Hillclimbing and Randomwalk.

We combine pattern database heuristics using five different cost partitioning methods. The experiments compare computing cost partitionings over regular and boosted pattern databases. The experiments, performed on IPC (optimal track) tasks, show promising results which increased the coverage (number of solved tasks) by 9 for canonical cost partitioning using our Randomwalk boosting variant.

One dimensional potential heuristics assign a numerical value, the potential, to each fact of a classical planning problem. The heuristic value of a state is the sum over the poten- tials belonging to the facts contained in the state. Fišer et al. (2020) recently proposed to strengthen potential heuristics utilizing mutexes and disambiguations. In this thesis, we embed the same enhancements in the planning system Fast Downward. The experi- mental evaluation shows that the strengthened potential heuristics are a refinement, but too computationally expensive to solve more problems than the non-strengthened potential heuristics.

The potentials are obtained with a Linear Program. Fišer et al. (2020) introduced an additional constraint on the initial state and we propose additional constraints on random states. The additional constraints improve the amount of solved problems by up to 5%.

This thesis discusses the PINCH heuristic, a specific implementation of the additive heuristic. PINCH intends to combine the strengths of existing implementations of the additive heuristic. The goal of this thesis is to really dig into the PINCH heuristic. I want to provide the most accessible resource for understanding PINCH and I want to analyze the performance of PINCH by comparing it to the algorithm on which it is based, Generalized Dijkstra.

Suboptimal search algorithms can offer attractive benefits compared to optimal search, namely increased coverage of larger search problems and quicker search times. Improving on such algorithms, such as reducing costs further towards optimal solutions and reducing the number of node expansions, is therefore a compelling area for further research. This paper explores the utility and scalability of recently developed priority functions, XDP, XUP, and PWXDP, and the Improved Optimistic Search algorithm, compared to Weighted A*, in the Fast Downward planner. Analyses focus on the cost, total time, coverage, and node expansion parameters, with experimental evidence suggesting preferable performance if strict optimality is not desired. The implementation of priorityb functions in eager best-first search showed marked improvements compared to A* search on coverage, total time, and number of expansions, without significant cost penalties. Following previous suboptimal search research, experimental evidence even seems to indicate that these cost penalties do not reach the designated bound, even in larger search spaces.

In the Automated Planning field, algorithms and systems are developed for exploring state spaces and ultimately finding an action sequence leading from a task’s initial state to its goal. Such planning systems may sometimes show unexpected behavior, caused by a planning task or a bug in the planner itself. Generally speaking, finding the source of a bug tends to be easier when the cause can be isolated or simplified. In this thesis, we tackle this problem by making PDDL and SAS+ tasks smaller while ensuring they still invoke a certain characteristic when executed with a planner. We implement a system that successively removes elements, such as objects, from a task and checks whether the transformed task still fails on the planner. Elements are removed in a syntactically consistent way, however, no semantic integrity is enforced. Our system’s design is centered around the Fast Downward Planning System, as we re-use some of its translator modules and all test runs are performed with Fast Downward. At the core of our system, first-choice hill-climbing is used for optimization. Our “minimizer” takes (1) a failing planner execution command, (2) a description of the failing characteristic and (3) the type of element to be deleted as arguments. We evaluate our system’s functionality on the basis of three use-cases. In our most successful test runs, (1) a SAS+ task with initially 1536 operators and 184 variables is reduced to 2 operators and 2 variables and (2)a PDDL task with initially 46 actions, 62 objects and 29 predicate symbols is reduced to 2 actions, 6 objects and 4 predicates.

Fast Downward is a classical planning system based on heuristic search. Its successor generator is an efficient and intelligent tool to process state spaces and generate their successor states. In this thesis we implement different successor generators in the Fast Downward planning system and compare them against each other. Apart from the given fast downward successor generator we implement four other successor generators: a naive successor generator, one based on the marking of delete relaxed heuristics, one based on the PSVN planning system and one based on watched literals as used in modern SAT solvers. These successor generators are tested in a variety of different planning benchmarks to see how well they compete against each other. We verified that there is a trade-off between precomputation and faster successor generation and showed that all of the implemented successor generators have a use case and it is advisable to switch to a successor generator that fits the style of the planning task.

Verifying whether a planning algorithm came to the correct result for a given planning task is easy if a plan is emitted which solves the problem. But if a task is unsolvable most planners just state this fact without any explanation or even proof. In this thesis we present extended versions of the symbolic search algorithms SymPA and symbolic bidirectional uniform-cost search which, if a given planning task is unsolvable, provide certificates which prove unsolvability. We also discuss a concrete implementation of this version of SymPA.

Classical planning is an attractive approach to solving problems because of its generality and its relative ease of use. Domain-specific algorithms are appealing because of their performance, but require a lot of resources to be implemented. In this thesis we evaluate concepts languages as a possible input language for expert domain knowledge into a planning system. We also explore mixed integer programming as a way to use this knowledge to improve search efficiency and to help the user find and refine useful domain knowledge.

Classical Planning is a branch of artificial intelligence that studies single agent, static, deterministic, fully observable, discrete search problems. A common challenge in this field is the explosion of states to be considered when searching for the goal. One technique that has been developed to mitigate this is Strong Stubborn Set based pruning, where on each state expansion, the considered successors are restricted to Strong Stubborn Sets, which exploit the properties of independent operators to cut down the tree or graph search. We adopt the definitions of the theory of Strong Stubborn Sets from the SAS+ setting to transition systems and validate a central theorem about the correctness of Strong Stubborn Set based pruning for transition systems in the interactive theorem prover Isabelle/HOL.

Ein wichtiges Feld in der Wissenschaft der künstliche Intelligenz sind Planungsprobleme. Man hat das Ziel, eine künstliche intelligente Maschine zu bauen, die mit so vielen ver- schiedenen Probleme umgehen und zuverlässig lösen kann, indem sie ein optimaler Plan herstellt.

Der Trial-based Heuristic Tree Search(THTS) ist ein mächtiges Werkzeug um Multi-Armed- Bandit-ähnliche Probleme, Marcow Decsision Processe mit verändernden Rewards, zu lösen. Beim momentanen THTS können explorierte gefundene gute Rewards auf Grund von der grossen Anzahl der Rewards nicht beachtet werden. Ebenso können beim explorieren schlech- te Rewards, gute Knoten im Suchbaum, verschlechtern. Diese Arbeit führt eine Methodik ein, die von der stückweise stationären MABs Problematik stammt, um den THTS weiter zu optimieren.

Abstractions are a simple yet powerful method of creating a heuristic to solve classical planning problems optimally. In this thesis we make use of Cartesian abstractions generated with Counterexample-Guided Abstraction Refinement (CEGAR). This method refines abstractions incrementally by finding flaws and then resolving them until the abstraction is sufficiently evolved. The goal of this thesis is to implement and evaluate algorithms which select solutions of such flaws, in a way which results in the best abstraction (that is, the abstraction which causes the problem to then be solved most efficiently by the planner). We measure the performance of a refinement strategy by running the Fast Downward planner on a problem and measuring how long it takes to generate the abstraction, as well as how many expansions the planner requires to find a goal using the abstraction as a heuristic. We use a suite of various benchmark problems for evaluation, and we perform this experiment for a single abstraction and on abstractions for multiple subtasks. Finally, we attempt to predict which refinement strategy should be used based on parameters of the task, potentially allowing the planner to automatically select the best strategy at runtime.

Heuristic search is a powerful paradigm in classical planning. The information generated by heuristic functions to guide the search towards a goal is a key component of many modern search algorithms. The paper “Using Backwards Generated Goals for Heuristic Planning” by Alcázar et al. proposes a way to make additional use of this information. They take the last actions of a relaxed plan as a basis to generate intermediate goals with a known path to the original goal. A plan is found when the forward search reaches an intermediate goal.

The premise of this thesis is to modify their approach by focusing on a single sequence of intermediate goals. The aim is to improve efficiency while preserving the benefits of backwards goal expansion. We propose different variations of our approach by introducing multiple ways to make decisions concerning the construction of intermediate goals. We evaluate these variations by comparing their performance and illustrate the challenges posed by this approach.

Counterexample-guided abstraction refinement (CEGAR) is a way to incrementally compute abstractions of transition systems. It starts with a coarse abstraction and then iteratively finds an abstract plan, checks where the plan fails in the concrete transition system and refines the abstraction such that the same failure cannot happen in subsequent iterations. As the abstraction grows in size, finding a solution for the abstract system becomes more and more costly. Because the abstraction grows incrementally, however, it is possible to maintain heuristic information about the abstract state space, allowing the use of informed search algorithms like A*. As the quality of the heuristic is crucial to the performance of informed search, the method for maintaining the heuristic has a significant impact on the performance of the abstraction refinement as a whole. In this thesis, we investigate different methods for maintaining the value of the perfect heuristic h* at all times and evaluate their performance.

Pattern Databases are a powerful class of abstraction heuristics which provide admissible path cost estimates by computing exact solution costs for all states of a smaller task. Said task is obtained by abstracting away variables of the original problem. Abstractions with few variables offer weak estimates, while introduction of additional variables is guaranteed to at least double the amount of memory needed for the pattern database. In this thesis, we present a class of algorithms based on counterexample-guided abstraction refinement (CEGAR), which exploit additivity relations of patterns to produce pattern collections from which we can derive heuristics that are both informative and computationally tractable. We show that our algorithms are competitive with already existing pattern generators by comparing their performance on a variety of planning tasks.

We consider the problem of Rubik’s Cube to evaluate modern abstraction heuristics. In order to find feasible abstractions of the enormous state space spanned by Rubik’s Cube, we apply projection in the form of pattern databases, Cartesian abstraction by doing counterexample guided abstraction refinement as well as merge-and-shrink strategies. While previous publications on Cartesian abstractions have not covered applicability for planning tasks with conditional effects, we introduce factorized effect tasks and show that Cartesian abstraction can be applied to them. In order to evaluate the performance of the chosen heuristics, we run experiments on different problem instances of Rubik’s Cube. We compare them by the initial h-value found for all problems and analyze the number of expanded states up to the last f-layer. These criteria provide insights about the informativeness of the considered heuristics. Cartesian Abstraction yields perfect heuristic values for problem instances close to the goal, however it is outperformed by pattern databases for more complex instances. Even though merge-and-shrink is the most general abstraction among the considered, it does not show better performance than the others.

Probabilistic planning expands on classical planning by tying probabilities to the effects of actions. Due to the exponential size of the states, probabilistic planners have to come up with a strong policy in a very limited time. One approach to optimising the policy that can be found in the available time is called metareasoning, a technique aiming to allocate more deliberation time to steps where more time to plan results in an improvement of the policy and less deliberation time to steps where an improvement of the policy with more time to plan is unlikely.

This thesis aims to adapt a recent proposal of a formal metareasoning procedure from Lin. et al. for the search algorithm BRTDP to work with the UCT algorithm in the Prost planner and compare its viability to the current standard and a number of less informed time management methods in order to find a potential improvement to the current uniform deliberation time distribution.

A planner tries to produce a policy that leads to a desired goal given the available range of actions and an initial state. A traditional approach for an algorithm is to use abstraction. In this thesis we implement the algorithm described in the ASAP-UCT paper: Abstraction of State-Action Pairs in UCT by Ankit Anand, Aditya Grover, Mausam and Parag Singla.

The algorithm combines state and state-action abstraction with a UCT-algorithm. We come to the conclusion that the algorithm needs to be improved because the abstraction of action-state often cannot detect a similarity that a reasonable action abstraction could find.

The notion of adding a form of exploration to guide a search has been proven to be an effective method of combating heuristical plateaus and improving the performance of greedy best-first search. The goal of this thesis is to take the same approach and introduce exploration in a bounded suboptimal search problem. Explicit estimation search (EES), established by Thayer and Ruml, consults potentially inadmissible information to determine the search order. Admissible heuristics are then used to guarantee the cost bound. In this work we replace the distance-to-go estimator used in EES with an approach based on the concept of novelty.

Classical domain-independent planning is about finding a sequence of actions which lead from an initial state to a goal state. A popular approach for solving planning problems efficiently is to utilize heuristic functions. A possible heuristic function is the perfect heuristic of a delete relaxed planning problem denoted as h+. Delete relaxation simplifies the planning problem thus making it easier to find a perfect heuristic. However computing h+ is still NP-hard problem.

In this thesis we discuss a promising looking approach to compute h+ in practice. Inspired by the paper from Gnad, Hoffmann and Domshlak about star-shaped planning problems, we implemented the Flow-Cut algorithm. The basic idea behind flow-cut to divide a problem that is unsolvable in practice, into smaller sub problems that can be solved. We further tested the flow-cut algorithm on the domains provided by the International Planning Competition benchmarks, resulting in the following conclusion: Using a divide and conquer approach can successfully be used to solve classical planning problems, however it is not trivial to design such an algorithm to be more efficient than state-of-the-art search algorithm.

This thesis deals with the algorithm presented in the paper "Landmark-based Meta Best-First Search Algorithm: First Parallelization Attempt and Evaluation" by Simon Vernhes, Guillaume Infantes and Vincent Vidal. Their idea was to reconsider the approach to landmarks as a tool in automated planning, but in a markedly different way than previous work had done. Their result is a meta-search algorithm which explores landmark orderings to find a series of subproblems that reliably lead to an effective solution. Any complete planner may be used to solve the subproblems. While the referenced paper also deals with an attempt to effectively parallelize the Landmark-based Meta Best-First Search Algorithm, this thesis is concerned mainly with the sequential implementation and evaluation of the algorithm in the Fast Downward planning system.

Heuristics play an important role in classical planning. Using heuristics during state space search often reduces the time required to find a solution, but constructing heuristics and using them to calculate heuristic values takes time, reducing this benefit. Constructing heuristics and calculating heuristic values as quickly as possible is very important to the effectiveness of a heuristic. In this thesis we introduce methods to bound the construction of merge-and-shrink to reduce its construction time and increase its accuracy for small problems and to bound the heuris- tic calculation of landmark cut to reduce heuristic value calculation time. To evaluate the performance of these depth-bound heuristics we have implemented them in the Fast Down- ward planning system together with three iterative-deepening heuristic search algorithms: iterative-deepening A* search, a new breadth-first iterative-deepening version of A* search and iterative-deepening breadth-first heuristic search.

Greedy best-first search has proven to be a very efficient approach to satisficing planning but can potentially lose some of its effectiveness due to the used heuristic function misleading it to a local minimum or plateau. This is where exploration with additional open lists comes in, to assist greedy best-first search with solving satisficing planning tasks more effectively. Building on the idea of exploration by clustering similar states together as described by Xie et al. [2014], where states are clustered according to heuristic values, we propose in this paper to instead cluster states based on the Hamming distance of the binary representation of states [Hamming, 1950]. The resulting open list maintains k buckets and inserts each given state into the bucket with the smallest average hamming distance between the already clustered states and the new state. Additionally, our open list is capable of reclustering all states periodically with the use of the k-means algorithm. We were able to achieve promising results concerning the amount of expansions necessary to reach a goal state, despite not achieving a higher coverage than fully random exploration due to slow performance. This was caused by the amount of calculations required to identify the most fitting cluster when inserting a new state.

Monte Carlo Tree Search Algorithms are an efficient method of solving probabilistic planning tasks that are modeled by Markov Decision Problems. MCTS uses two policies, a tree policy for iterating through the known part of the decission tree and a default policy to simulate the actions and their reward after leaving the tree. MCTS algorithms have been applied with great success to computer Go. To make the two policies fast many enhancements based on online knowledge have been developed. The goal of All Moves as First enhancements is to improve the quality of a reward estimate in the tree policy. In the context of this thesis the, in the field of computer Go very efficient, α-AMAF, Cutoff-AMAF as well as Rapid Action Value Estimation enhancements are implemented in the probabilistic planner PROST. To obtain a better default policy, Move Average Sampling is implemented into PROST and benchmarked against it’s current default policies.

In classical planning the objective is to find a sequence of applicable actions that lead from the initial state to a goal state. In many cases the given problem can be of enormous size. To deal with these cases, a prominent method is to use heuristic search, which uses a heuristic function to evaluate states and can focus on the most promising ones. In addition to applying heuristics, the search algorithm can apply additional pruning techniques that exclude applicable actions in a state because applying them at a later point in the path would result in a path consisting of the same actions but in a different order. The question remains as to how these actions can be selected without generating too much additional work to still be useful for the overall search. In this thesis we implement and evaluate the partition-based path pruning method, proposed by Nissim et al. [1], which tries to decompose the set of all actions into partitions. Based on this decomposition, actions can be pruned with very little additional information. The partition-based pruning method guarantees with some alterations to the A* search algorithm to preserve it’s optimality. The evaluation confirms that in several standard planning domains, the pruning method can reduce the size of the explored state space.

Validating real-time systems is an important and complex task which becomes exponentially harder with increasing sizes of systems. Therefore finding an automated approach to check real-time systems for possible errors is crucial. The behaviour of such real-time systems can be modelled with timed automata. This thesis adapts and implements the under-approximation refinement algorithm developed for search based planners proposed by Heusner et al. to find error states in timed automata via the directed model checking approach. The evaluation compares the algorithm to already existing search methods and shows that a basic under-approximation refinement algorithm yields a competitive search method for directed model checking which is both fast and memory efficient. Additionally we illustrate that with the introduction of some minor alterations the proposed under- approximation refinement algorithm can be further improved.

In dieser Arbeit wird versucht eine Heuristik zu lernen. Damit eine Heuristik erlernbar ist, muss sie über Parameter verfügen, die die Heuristik bestimmen. Eine solche Möglichkeit bieten Potential-Heuristiken und ihre Parameter werden Potentiale genannt. Pattern-Databases können mit vergleichsweise wenig Aufwand Eigenschaften eines Zustandsraumes erkennen und können somit eingesetzt werden als Grundlage um Potentiale zu lernen. Diese Arbeit untersucht zwei verschiedene Ansätze zum Erlernen der Potentiale aufgrund der Information aus Pattern-Databases. In Experimenten werden die beiden Ansätze genauer untersucht und schliesslich mit der FF-Heuristik verglichen.

We consider real-time strategy (RTS) games which have temporal and numerical aspects and pose challenges which have to be solved within limited search time. These games are interesting for AI research because they are more complex than board games. Current AI agents cannot consistently defeat average human players, while even the best players make mistakes we think an AI could avoid. In this thesis, we will focus on StarCraft Brood War. We will introduce a formal definition of the model Churchill and Buro proposed for StarCraft. This allows us to focus on Build Order optimization only. We have implemented a base version of the algorithm Churchill and Buro used for their agent. Using the implementation we are able to find solutions for Build Order Problems in StarCraft Brood War.

Auf dem Gebiet der Handlungsplanung stellt die symbolische Suche eine der erfolgversprechendsten angewandten Techniken dar. Um eine symbolische Suche auf endlichen Zustandsräumen zu implementieren bedarf es einer geeigneten Datenstruktur für logische Formeln. Diese Arbeit erprobt die Nutzung von Sentential Decision Diagrams (SDDs) anstelle der gängigen Binary Decision Diagrams (BDDs) zu diesem Zweck. SDDs sind eine Generalisierung von BDDs. Es wird empirisch getestet wie eine Implementierung der symbolischen Suche mit SDDs im FastDownward-Planer sich mit verschiedenen vtrees unterscheidet. Insbesondere wird die Performance von balancierten vtrees, mit welchen die Stärken von SDDs oft gut zur Geltung kommen, mit rechtsseitig linearen vtrees verglichen, bei welchen sich SDDs wie BDDs verhalten.

Die Frage ob es gültige Sudokus - d.h. Sudokus mit nur einer Lösung - gibt, die nur 16 Vorgaben haben, konnte im Dezember 2011 mithilfe einer erschöpfenden Brute-Force-Methode von McGuire et al. verneint werden. Die Schwierigkeit dieser Aufgabe liegt in dem ausufernden Suchraum des Problems und der dadurch entstehenden Erforderlichkeit einer effizienten Beweisidee sowie schnellerer Algorithmen. In dieser Arbeit wird die Beweismethode von McGuire et al. bestätigt werden und für 2 2 × 2 2 und 3 2 × 3 2 Sudokus in C++ implementiert.

Das Finden eines kürzesten Pfades zwischen zwei Punkten ist ein fundamentales Problem in der Graphentheorie. In der Praxis ist es oft wichtig, den Ressourcenverbrauch für das Ermitteln eines solchen Pfades minimal zu halten, was mithilfe einer komprimierten Pfaddatenbank erreicht werden kann. Im Rahmen dieser Arbeit bestimmen wir drei Verfahren, mit denen eine Pfaddatenbank möglichst platzsparend aufgestellt werden kann, und evaluieren die Effektivität dieser Verfahren anhand von Probleminstanzen verschiedener Grösse und Komplexität.

In planning what we want to do is to get from an initial state into a goal state. A state can be described by a finite number of boolean valued variables. If we want to transition from one state to the other we have to apply an action and this, at least in probabilistic planning, leads to a probability distribution over a set of possible successor states. From each transition the agent gains a reward dependent on the current state and his action. In this setting the growth of the number of possible states is exponential with the number of variables. We assume that the value of these variables is determined for each variable independently in a probabilistic fashion. So these variables influence the number of possible successor states in the same way as they did the state space. In consequence it is almost impossible to obtain an optimal amount of reward approaching this problem with a brute force technique. One way to get past this problem is to abstract the problem and then solve a simplified version of the aforementioned. That’s in general the idea proposed by Boutilier and Dearden [1]. They have introduced a method to create an abstraction which depends on the reward formula and the dependencies contained in the problem. With this idea as a basis we’ll create a heuristic for a trial-based heuristic tree search (THTS) algorithm [5] and a standalone planner using the framework PROST (Keller and Eyerich, 2012). These will then be tested on all the domains of the International Probabilistic Planning Competition (IPPC).

In einer Planungsaufgabe geht es darum einen gegebenen Wertezustand durch sequentielles Anwenden von Aktionen in einen Wertezustand zu überführen, welcher geforderte Zieleigenschaften erfüllt. Beim Lösen von Planungsaufgaben zählt Effizienz. Um Zeit und Speicher zu sparen verwenden viele Planer heuristische Suche. Dabei wird mittels einer Heuristik abgeschätzt, welche Aktion als nächstes angewendet werden soll um möglichst schnell in einen gewünschten Zustand zu gelangen.

In dieser Arbeit geht es darum, die von Haslum vorgeschlagene P m -Kompilierung für Planungsaufgaben zu implementieren und die h max -Heuristik auf dem kompilierten Problem gegen die h m -Heuristik auf dem originalen Problem zu testen. Die Implementation geschieht als Ergänzung zum Fast-Downward-Planungssystem. Die Resultate der Tests zeigen, dass mittels der Kompilierung die Zahl der gelösten Probleme erhöht werden kann. Das Lösen eines kompilierten Problems mit der h max -Heuristik geschieht im allgemeinen mit selbiger Informationstiefe schneller als das Lösen des originalen Problems mit der h m -Heuristik. Diesen Zeitgewinn erkauft man sich mit einem höheren Speicherbedarf.

The objective of classical planning is to find a sequence of actions which begins in a given initial state and ends in a state that satisfies a given goal condition. A popular approach to solve classical planning problems is based on heuristic forward search algorithms. In contrast, regression search algorithms apply actions “backwards” in order to find a plan from a goal state to the initial state. Currently, regression search algorithms are somewhat unpopular, as the generation of partial states in a basic regression search often leads to a significant growth of the explored search space. To tackle this problem, state subsumption is a pruning technique that additionally discards newly generated partial states for which a more general partial state has already been explored.

In this thesis, we discuss and evaluate techniques of regression and state subsumption. In order to evaluate their performance, we have implemented a regression search algorithm for the planning system Fast Downward, supporting both a simple subsumption technique as well as a refined subsumption technique using a trie data structure. The experiments have shown that a basic regression search algorithm generally increases the number of explored states compared to uniform-cost forward search. Regression with pruning based on state subsumption with a trie data structure significantly reduces the number of explored states compared to basic regression.

This thesis discusses the Traveling Tournament Problem and how it can be solved with heuristic search. The Traveling Tournament problem is a sports scheduling problem where one tries to find a schedule for a league that meets certain constraints while minimizing the overall distance traveled by the teams in this league. It is hard to solve for leagues with many teams involved since its complexity grows exponentially in the number of teams. The largest instances solved up to date, are instances with leagues of up to 10 teams.

Previous related work has shown that it is a reasonable approach to solve the Traveling Tournament Problem with an IDA*-based tree search. In this thesis I implemented such a search and extended it with several enhancements to examine whether they improve performance of the search. The heuristic that I used in my implementation is the Independent Lower Bound heuristic. It tries to find lower bounds to the traveling costs of each team in the considered league. With my implementation I was able to solve problem instances with up to 8 teams. The results of my evaluation have mostly been consistent with the expected impact of the implemented enhancements on the overall performance.

One huge topic in Artificial Intelligence is the classical planning. It is the process of finding a plan, therefore a sequence of actions that leads from an initial state to a goal state for a specified problem. In problems with a huge amount of states it is very difficult and time consuming to find a plan. There are different pruning methods that attempt to lower the amount of time needed to find a plan by trying to reduce the number of states to explore. In this work we take a closer look at two of these pruning methods. Both of these methods rely on the last action that led to the current state. The first one is the so called tunnel pruning that is a generalisation of the tunnel macros that are used to solve Sokoban problems. The idea is to find actions that allow a tunnel and then prune all actions that are not in the tunnel of this action. The second method is the partition-based path pruning. In this method all actions are distributed into different partitions. These partitions then can be used to prune actions that do not belong to the current partition.

The evaluation of these two pruning methods show, that they can reduce the number of explored states for some problem domains, however the difference between pruned search and normal search gets smaller when we use heuristic functions. It also shows that the two pruning rules effect different problem domains.

Ziel klassischer Handlungsplanung ist es auf eine möglichst effiziente Weise gegebene Planungsprobleme zu lösen. Die Lösung bzw. der Plan eines Planungsproblems ist eine Sequenz von Operatoren mit denen man von einem Anfangszustand in einen Zielzustand gelangt. Um einen Zielzustand gezielter zu finden, verwenden einige Suchalgorithmen eine zusätzliche Information über den Zustandsraum - die Heuristik. Sie schätzt, ausgehend von einem Zustand den Abstand zum Zielzustand. Demnach wäre es ideal, wenn jeder neue besuchte Zustand einen kleineren heuristischen Wert aufweisen würde als der bisher besuchte Zustand. Es gibt allerdings Suchszenarien bei denen die Heuristik nicht weiterhilft um einem Ziel näher zu kommen. Dies ist insbesondere dann der Fall, wenn sich der heuristische Wert von benachbarten Zuständen nicht ändert. Für die gierige Bestensuche würde das bedeuten, dass die Suche auf Plateaus und somit blind verläuft, weil sich dieser Suchalgorithmus ausschliesslich auf die Heuristik stützt. Algorithmen, die die Heuristik als Wegweiser verwenden, gehören zur Klasse der heuristischen Suchalgorithmen.

In dieser Arbeit geht es darum, in Fällen wie den Plateaus trotzdem eine Orientierung im Zustandsraum zu haben, indem Zustände neben der Heuristik einer weiteren Priorisierung unterliegen. Die hier vorgestellte Methode nutzt Abhängigkeiten zwischen Operatoren aus und erweitert die gierige Bestensuche. Wie stark Operatoren voneinander abhängen, betrachten wir anhand eines Abstandsmasses, welches vor der eigentlichen Suche berechnet wird. Die grundlegende Idee ist, Zustände zu bevorzugen, deren Operatoren im Vorfeld voneinander profitierten. Die Heuristik fungiert hierbei erst im Nachhinein als Tie-Breaker, sodass wir einem vielversprechenden Pfad zunächst folgen können, ohne dass uns die Heuristik an einer anderen, weniger vielversprechenden Stelle suchen lässt.

Die Ergebnisse zeigen, dass unser Ansatz in der reinen Suchzeit je nach Heuristik performanter sein kann, als wenn man sich ausschliesslich auf die Heuristik stützt. Bei sehr informationsreichen Heuristiken kann es jedoch passieren, dass die Suche durch unseren Ansatz eher gestört wird. Zudem werden viele Probleme nicht gelöst, weil die Berechnung der Abstände zu zeitaufwändig ist.

In classical planning, heuristic search is a popular approach to solving problems very efficiently. The objective of planning is to find a sequence of actions that can be applied to a given problem and that leads to a goal state. For this purpose, there are many heuristics. They are often a big help if a problem has a solution, but what happens if a problem does not have one? Which heuristics can help proving unsolvability without exploring the whole state space? How efficient are they? Admissible heuristics can be used for this purpose because they never overestimate the distance to a goal state and are therefore able to safely cut off parts of the search space. This makes it potentially easier to prove unsolvability

In this project we developed a problem generator to automatically create unsolvable problem instances and used those generated instances to see how different admissible heuristics perform on them. We used the Japanese puzzle game Sokoban as the first problem because it has a high complexity but is still easy to understand and to imagine for humans. As second problem, we used a logistical problem called NoMystery because unlike Sokoban it is a resource constrained problem and therefore a good supplement to our experiments. Furthermore, unsolvability occurs rather 'naturally' in these two domains and does not seem forced.

Sokoban is a computer game where each level consists of a two-dimensional grid of fields. There are walls as obstacles, moveable boxes and goal fields. The player controls the warehouse worker (Sokoban in Japanese) to push the boxes to the goal fields. The problem is very complex and that is why Sokoban has become a domain in planning.

Phase transitions mark a sudden change in solvability when traversing through the problem space. They occur in the region of hard instances and have been found for many domains. In this thesis we investigate phase transitions in the Sokoban puzzle. For our investigation we generate and evaluate random instances. We declare the defining parameters for Sokoban and measure their influence on the solvability. We show that phase transitions in the solvability of Sokoban can be found and their occurrence is measured. We attempt to unify the parameters of Sokoban to get a prediction on the solvability and hardness of specific instances.

In planning, we address the problem of automatically finding a sequence of actions that leads from a given initial state to a state that satisfies some goal condition. In satisficing planning, our objective is to find plans with preferably low, but not necessarily the lowest possible costs while keeping in mind our limited resources like time or memory. A prominent approach for satisficing planning is based on heuristic search with inadmissible heuristics. However, depending on the applied heuristic, plans found with heuristic search might be of low quality, and hence, improving the quality of such plans is often desirable. In this thesis, we adapt and apply iterative tunneling search with A* (ITSA*) to planning. ITSA* is an algorithm for plan improvement which has been originally proposed by Furcy et al. for search problems. ITSA* intends to search the local space of a given solution path in order to find "short cuts" which allow us to improve our solution. In this thesis, we provide an implementation and systematic evaluation of this algorithm on the standard IPC benchmarks. Our results show that ITSA* also successfully works in the planning area.

In action planning, greedy best-first search (GBFS) is one of the standard techniques if suboptimal plans are accepted. GBFS uses a heuristic function to guide the search towards a goal state. To achieve generality, in domain-independant planning the heuristic function is generated automatically. A well-known problem of GBFS are search plateaus, i.e., regions in the search space where all states have equal heuristic values. In such regions, heuristic search can degenerate to uninformed search. Hence, techniques to escape from such plateaus are desired to improve the efficiency of the search. A recent approach to avoid plateaus is based on diverse best-first search (DBFS) proposed by Imai and Kishimoto. However, this approach relies on several parameters. This thesis presents an implementation of DBFS into the Fast Downward planner. Furthermore, this thesis presents a systematic evaluation of DBFS for several parameter settings, leading to a better understanding of the impact of the parameter choices to the search performance.

Risk is a popular board game where players conquer each other's countries. In this project, I created an AI that plays Risk and is capable of learning. For each decision it makes, it performs a simple search one step ahead, looking at the outcomes of all possible moves it could make, and picks the most beneficial. It judges the desirability of outcomes by a series of parameters, which are modified after each game using the TD(λ)-Algorithm, allowing the AI to learn.

The Canadian Traveler's Problem ( ctp ) is a path finding problem where due to unfavorable weather, some of the roads are impassable. At the beginning, the agent does not know which roads are traversable and which are not. Instead, it can observe the status of roads adjacent to its current location. We consider the stochastic variant of the problem, where the blocking status of a connection is randomly defined with known probabilities. The goal is to find a policy which minimizes the expected travel costs of the agent.

We discuss several properties of the stochastic ctp and present an efficient way to calculate state probabilities. With the aid of these theoretical results, we introduce an uninformed algorithm to find optimal policies.

Finding optimal solutions for general search problems is a challenging task. A powerful approach for solving such problems is based on heuristic search with pattern database heuristics. In this thesis, we present a domain specific solver for the TopSpin Puzzle problem. This solver is based on the above-mentioned pattern database approach. We investigate several pattern databases, and evaluate them on problem instances of different size.

Merge-and-shrink abstractions are a popular approach to generate abstraction heuristics for planning. The computation of merge-and-shrink abstractions relies on a merging and a shrinking strategy. A recently investigated shrinking strategy is based on using bisimulations. Bisimulations are guaranteed to produce perfect heuristics. In this thesis, we investigate an efficient algorithm proposed by Dovier et al. for computing coarsest bisimulations. The algorithm, however, cannot directly be applied to planning and needs some adjustments. We show how this algorithm can be reduced to work with planning problems. In particular, we show how an edge labelled state space can be translated to a state labelled one and what other changes are necessary for the algorithm to be usable for planning problems. This includes a custom data structure to fulfil all requirements to meet the worst case complexity. Furthermore, the implementation will be evaluated on planning problems from the International Planning Competitions. We will see that the resulting algorithm can often not compete with the currently implemented algorithm in Fast Downward. We discuss the reasons why this is the case and propose possible solutions to resolve this issue.

In order to understand an algorithm, it is always helpful to have a visualization that shows step for step what the algorithm is doing. Under this presumption this Bachelor project will explain and visualize two AI techniques, Constraint Satisfaction Processing and SAT Backbones, using the game Gnomine as an example.

CSP techniques build up a network of constraints and infer information by propagating through a single or several constraints at a time, reducing the domain of the variables in the constraint(s). SAT Backbone Computations find literals in a propositional formula, which are true in every model of the given formula.

By showing how to apply these algorithms on the problem of solving a Gnomine game I hope to give a better insight on the nature of how the chosen algorithms work.

Planning as heuristic search is a powerful approach to solve domain-independent planning problems. An important class of heuristics is based on abstractions of the original planning task. However, abstraction heuristics usually come with loss in precision. The contribution of this thesis is the investigation of constrained abstraction heuristics in general, and the application of this concept to pattern database and merge and shrink abstractions in particular. The idea is to use a subclass of mutexes which represent sets of variable-value-pairs so that only one of these pairs can be true at any given time, to regain some of the precision which is lost in the abstraction without increasing its size. By removing states and operators in the abstraction which conflict with such a mutex, the abstraction is refined and hence, the corresponding abstraction heuristic can get more informed. We have implemented the refinements of these heuristics in the Fast Downward planner and evaluated the different approaches using standard IPC benchmarks. The results show that the concept of constrained abstraction heuristics can improve planning as heuristic search in terms of time and coverage.

A permutation problem considers the task where an initial order of objects (ie, an initial mapping of objects to locations) must be reordered into a given goal order by using permutation operators. Permutation operators are 1:1 mappings of the objects from their locations to (possibly other) locations. An example for permutation problems are the wellknown Rubik's Cube and TopSpin Puzzle. Permutation problems have been a research area for a while, and several methods for solving such problems have been proposed in the last two centuries. Most of these methods focused on finding optimal solutions, causing an exponential runtime in the worst case.

In this work, we consider an algorithm for solving permutation problems that has been originally proposed by M. Furst, J. Hopcroft and E. Luks in 1980. This algorithm has been introduced on a theoretical level within a proof for "Testing Membership and Determining the Order of a Group", but has not been implemented and evaluated on practical problems so far. In contrast to the other abovementioned solving algorithms, it only finds suboptimal solutions, but is guaranteed to run in polynomial time. The basic idea is to iteratively reach subgoals, and then to let them fix when we go further to reach the next goals. We have implemented this algorithm and evaluated it on different models, as the Pancake Problem and the TopSpin Puzzle .

Pattern databases (Culberson & Schaeffer, 1998) or PDBs, have been proven very effective in creating admissible Heuristics for single-agent search, such as the A*-algorithm. Haslum et. al proposed, a hill-climbing algorithm can be used to construct the PDBs, using the canonical heuristic. A different approach would be to change action-costs in the pattern-related abstractions, in order to obtain the admissible heuristic. This the so called Cost-Partitioning.

The aim of this project was to implement a cost-partitioning inside the hill-climbing algorithm by Haslum, and compare the results with the standard way which uses the canonical heuristic.

UCT ("upper confidence bounds applied to trees") is a state-of-the-art algorithm for acting under uncertainty, e.g. in probabilistic environments. In the last years it has been very successfully applied in numerous contexts, including two-player board games like Go and Mancala and stochastic single-agent optimization problems such as path planning under uncertainty and probabilistic action planning.

In this project the UCT algorithm was implemented, adapted and evaluated for the classical arcade game "Ms Pac-Man". The thesis introduces Ms Pac-Man and the UCT algorithm, discusses some critical design decisions for developing a strong UCT-based algorithm for playing Ms Pac-Man, and experimentally evaluates the implementation.

  • Bibliography
  • More Referencing guides Blog Automated transliteration Relevant bibliographies by topics
  • Automated transliteration
  • Relevant bibliographies by topics
  • Referencing guides

Dissertations / Theses on the topic 'AI, Machine Learning'

Create a spot-on reference in apa, mla, chicago, harvard, and other styles.

Consult the top 50 dissertations / theses for your research on the topic 'AI, Machine Learning.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.

Spronck, Pieter Hubert Marie. "Adaptive game AI." [Maastricht] : Maastricht : UPM, Universitaire Pers Maastricht ; University Library, Maastricht University [Host], 2005. http://arno.unimaas.nl/show.cgi?fid=5330.

Holmberg, Lars. "Human In Command Machine Learning." Licentiate thesis, Malmö universitet, Malmö högskola, Institutionen för datavetenskap och medieteknik (DVMT), 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-42576.

Felldin, Markus. "Machine Learning Methods for Fault Classification." Thesis, KTH, Skolan för datavetenskap och kommunikation (CSC), 2014. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-183132.

Rexby, Mattias. "SUPERVISED MACHINE LEARNING (SML) IN SIMULATED ENVIRONMENTS." Thesis, Mälardalens högskola, Akademin för innovation, design och teknik, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-54698.

Pincherle, Matteo. "AI takes chess to the ultimate level." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2018. http://amslaurea.unibo.it/16114/.

Schildt, Alexandra, and Jenny Luo. "Tools and Methods for Companies to Build Transparent and Fair Machine Learning Systems." Thesis, KTH, Skolan för industriell teknik och management (ITM), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-279659.

Andersson, Oscar, and Tim Andersson. "AI applications on healthcare data." Thesis, Högskolan i Halmstad, Akademin för informationsteknologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-44752.

Corinaldesi, Marianna. "Explainable AI: tassonomia e analisi di modelli spiegabili per il Machine Learning." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2022.

Convertini, Luciana. "Classificazione delle emozioni in base ai segnali EEG." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2019.

Wang, Ying. "Cooperative and intelligent control of multi-robot systems using machine learning." Thesis, University of British Columbia, 2008. http://hdl.handle.net/2429/905.

HALLGREN, ROSE. "Machine Dreaming." Thesis, KTH, Arkitektur, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-298504.

Lu, Shen. "Early identification of Alzheimer's disease using positron emission tomography imaging and machine learning." Thesis, University of Sydney, 2020. https://hdl.handle.net/2123/23735.

Gustafsson, Sebastian. "Interpretable serious event forecasting using machine learning and SHAP." Thesis, Uppsala universitet, Institutionen för informationsteknologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-444363.

Robertsson, Marcus, and Alexander Hirvonen. "Analyzing public transport delays using Machine Learning." Thesis, Högskolan i Halmstad, Akademin för informationsteknologi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-39045.

Solenne, Andrea. "Machine Learning nell'era del Digital Marketing." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2020. http://amslaurea.unibo.it/20476/.

Giuliani, Luca. "Extending the Moving Targets Method for Injecting Constraints in Machine Learning." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2021. http://amslaurea.unibo.it/23885/.

Stenekap, Daniel. "Classification of Gear-shift data using machine learning." Thesis, Mälardalens högskola, Akademin för innovation, design och teknik, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-53445.

Dissanayake, Lekamlage Dilukshi Charitha Subashini Dissanayake, and Fabia Afzal. "AI-based Age Estimation from Mammograms." Thesis, Blekinge Tekniska Högskola, Institutionen för datavetenskap, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:bth-20108.

Pajany, Peroumal. "AI Transformative Influence: Extending the TRAM to Management Student's AI’s Machine Learning Adoption." Franklin University / OhioLINK, 2021. http://rave.ohiolink.edu/etdc/view?acc_num=frank1623093426530669.

Bengtsson, Sebastian. "MACHINE LEARNING FOR MECHANICAL ANALYSIS." Thesis, Mälardalens högskola, Akademin för innovation, design och teknik, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-44325.

Magnusson, Ludvig, and Johan Rovala. "AI Approaches for Classification and Attribute Extraction in Text." Thesis, Linnéuniversitetet, Institutionen för datavetenskap (DV), 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:lnu:diva-67882.

Srivastava, Akshat. "Developing Functional Literacy of Machine Learning Among UX Design Students." University of Cincinnati / OhioLINK, 2021. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1617104876484835.

Thorén, Daniel. "Radar based tank level measurement using machine learning : Agricultural machines." Thesis, Linköpings universitet, Programvara och system, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-176259.

Ntsaluba, Kuselo Ntsika. "AI/Machine learning approach to identifying potential statistical arbitrage opportunities with FX and Bitcoin Markets." Master's thesis, Faculty of Commerce, 2019. http://hdl.handle.net/11427/31185.

Melsion, Perez Gaspar Isaac. "Leveraging Explainable Machine Learning to Raise Awareness among Preadolescents about Gender Bias in Supervised Learning." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-287554.

Gridelli, Eleonora. "Interpretabilità nel Machine Learning tramite modelli di ottimizzazione discreta." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2021. http://amslaurea.unibo.it/23216/.

Pettersson, Oscar. "Machine Learning Agents : En undersökning om Curiosity som belöningssystem för maskininlärda agenter." Thesis, Högskolan i Skövde, Institutionen för informationsteknologi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-17142.

Pedrini, Gianmaria. "Rogueinabox: a Rogue environment for AI learning. Framework development and Agents design." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2017. http://amslaurea.unibo.it/13812/.

Greer, Kieran R. C. "A neural network based search heuristic and its application to computer chess." Thesis, University of Ulster, 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.243736.

Häggström, Frida. "/Maybe/Probably/Certainly." Thesis, Konstfack, Grafisk design & illustration, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:konstfack:diva-7400.

Malka, Golan. "Thinknovation 2019: The Cyber as the new battlefield related to AI, BigData and Machine Learning Capabilities." Universidad Peruana de Ciencias Aplicadas (UPC), 2019. http://hdl.handle.net/10757/653843.

Kurén, Jonathan, Simon Leijon, Petter Sigfridsson, and Hampus Widén. "Fault Detection AI For Solar Panels." Thesis, Uppsala universitet, Institutionen för informationsteknologi, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-413319.

Radosavljevic, Bojan, and Axel Kimblad. "Etik och säkerhet när AI möter IoT." Thesis, Malmö universitet, Fakulteten för teknik och samhälle (TS), 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-20613.

Kurasinski, Lukas. "Machine Learning explainability in text classification for Fake News detection." Thesis, Malmö universitet, Fakulteten för teknik och samhälle (TS), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-20058.

Faria, Francisco Henrique Otte Vieira de. "Learning acyclic probabilistic logic programs from data." Universidade de São Paulo, 2017. http://www.teses.usp.br/teses/disponiveis/3/3141/tde-27022018-090821/.

Kantedal, Simon. "Evaluating Segmentation of MR Volumes Using Predictive Models and Machine Learning." Thesis, Linköpings universitet, Institutionen för medicinsk teknik, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-171102.

Arnesson, Pontus, and Johan Forslund. "Edge Machine Learning for Wildlife Conservation : Detection of Poachers Using Camera Traps." Thesis, Linköpings universitet, Reglerteknik, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-177483.

Norgren, Eric. "Pulse Repetition Interval Modulation Classification using Machine Learning." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-241152.

Holmgren, Viktor. "General-purpose maintenance planning using deep reinforcement learning and Monte Carlo tree search." Thesis, Linköpings universitet, Artificiell intelligens och integrerade datorsystem, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-163866.

Abdullah, Siti Norbaiti binti. "Machine learning approach for crude oil price prediction." Thesis, University of Manchester, 2014. https://www.research.manchester.ac.uk/portal/en/theses/machine-learning-approach-for-crude-oil-price-prediction(949fa2d5-1a4d-416a-8e7c-dd66da95398e).html.

Hedkvist, Adam. "Predictive maintenance with machine learning on weld joint analysed by ultrasound." Thesis, Uppsala universitet, Avdelningen för beräkningsvetenskap, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-396059.

Pergolini, Diego. "Reinforcement Learning: un caso di studio nell'ambito della Animal-AI Olympics." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/19415/.

Aljeri, Noura. "Efficient AI and Prediction Techniques for Smart 5G-enabled Vehicular Networks." Thesis, Université d'Ottawa / University of Ottawa, 2020. http://hdl.handle.net/10393/41497.

Wang, Olivier. "Adaptive Rules Model : Statistical Learning for Rule-Based Systems." Thesis, Université Paris-Saclay (ComUE), 2017. http://www.theses.fr/2017SACLX037/document.

El, Ahmar Wassim. "Head and Shoulder Detection using CNN and RGBD Data." Thesis, Université d'Ottawa / University of Ottawa, 2019. http://hdl.handle.net/10393/39448.

Bartoli, Giacomo. "Edge AI: Deep Learning techniques for Computer Vision applied to embedded systems." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2018. http://amslaurea.unibo.it/16820/.

Dinerstein, Jonathan J. "Improving and Extending Behavioral Animation Through Machine Learning." BYU ScholarsArchive, 2005. https://scholarsarchive.byu.edu/etd/310.

Schmitz, Michael Glenn. "Key Tension Points of creative Machine Learning applications keeping a Human-in-the-Loop." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-264570.

Liu, Jin. "Business models based on IoT, AI and blockchain." Thesis, Uppsala universitet, Industriell teknik, 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-360052.

Gestlöf, Rikard, and Johannes Sörman. "Contact-free Cognitive Load Classification based on Psycho-Physiological Parameters." Thesis, Mälardalens högskola, Akademin för innovation, design och teknik, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-44695.

Photo of a person's hands typing on a laptop.

AI-assisted writing is quietly booming in academic journals. Here’s why that’s OK

dissertations on ai

Lecturer in Bioethics, Monash University & Honorary fellow, Melbourne Law School, Monash University

Disclosure statement

Julian Koplin does not work for, consult, own shares in or receive funding from any company or organisation that would benefit from this article, and has disclosed no relevant affiliations beyond their academic appointment.

Monash University provides funding as a founding partner of The Conversation AU.

View all partners

If you search Google Scholar for the phrase “ as an AI language model ”, you’ll find plenty of AI research literature and also some rather suspicious results. For example, one paper on agricultural technology says:

As an AI language model, I don’t have direct access to current research articles or studies. However, I can provide you with an overview of some recent trends and advancements …

Obvious gaffes like this aren’t the only signs that researchers are increasingly turning to generative AI tools when writing up their research. A recent study examined the frequency of certain words in academic writing (such as “commendable”, “meticulously” and “intricate”), and found they became far more common after the launch of ChatGPT – so much so that 1% of all journal articles published in 2023 may have contained AI-generated text.

(Why do AI models overuse these words? There is speculation it’s because they are more common in English as spoken in Nigeria, where key elements of model training often occur.)

The aforementioned study also looks at preliminary data from 2024, which indicates that AI writing assistance is only becoming more common. Is this a crisis for modern scholarship, or a boon for academic productivity?

Who should take credit for AI writing?

Many people are worried by the use of AI in academic papers. Indeed, the practice has been described as “ contaminating ” scholarly literature.

Some argue that using AI output amounts to plagiarism. If your ideas are copy-pasted from ChatGPT, it is questionable whether you really deserve credit for them.

But there are important differences between “plagiarising” text authored by humans and text authored by AI. Those who plagiarise humans’ work receive credit for ideas that ought to have gone to the original author.

By contrast, it is debatable whether AI systems like ChatGPT can have ideas, let alone deserve credit for them. An AI tool is more like your phone’s autocomplete function than a human researcher.

The question of bias

Another worry is that AI outputs might be biased in ways that could seep into the scholarly record. Infamously, older language models tended to portray people who are female, black and/or gay in distinctly unflattering ways, compared with people who are male, white and/or straight.

This kind of bias is less pronounced in the current version of ChatGPT.

However, other studies have found a different kind of bias in ChatGPT and other large language models : a tendency to reflect a left-liberal political ideology.

Any such bias could subtly distort scholarly writing produced using these tools.

The hallucination problem

The most serious worry relates to a well-known limitation of generative AI systems: that they often make serious mistakes.

For example, when I asked ChatGPT-4 to generate an ASCII image of a mushroom, it provided me with the following output.

It then confidently told me I could use this image of a “mushroom” for my own purposes.

These kinds of overconfident mistakes have been referred to as “ AI hallucinations ” and “ AI bullshit ”. While it is easy to spot that the above ASCII image looks nothing like a mushroom (and quite a bit like a snail), it may be much harder to identify any mistakes ChatGPT makes when surveying scientific literature or describing the state of a philosophical debate.

Unlike (most) humans, AI systems are fundamentally unconcerned with the truth of what they say. If used carelessly, their hallucinations could corrupt the scholarly record.

Should AI-produced text be banned?

One response to the rise of text generators has been to ban them outright. For example, Science – one of the world’s most influential academic journals – disallows any use of AI-generated text .

I see two problems with this approach.

The first problem is a practical one: current tools for detecting AI-generated text are highly unreliable. This includes the detector created by ChatGPT’s own developers, which was taken offline after it was found to have only a 26% accuracy rate (and a 9% false positive rate ). Humans also make mistakes when assessing whether something was written by AI.

It is also possible to circumvent AI text detectors. Online communities are actively exploring how to prompt ChatGPT in ways that allow the user to evade detection. Human users can also superficially rewrite AI outputs, effectively scrubbing away the traces of AI (like its overuse of the words “commendable”, “meticulously” and “intricate”).

The second problem is that banning generative AI outright prevents us from realising these technologies’ benefits. Used well, generative AI can boost academic productivity by streamlining the writing process. In this way, it could help further human knowledge. Ideally, we should try to reap these benefits while avoiding the problems.

The problem is poor quality control, not AI

The most serious problem with AI is the risk of introducing unnoticed errors, leading to sloppy scholarship. Instead of banning AI, we should try to ensure that mistaken, implausible or biased claims cannot make it onto the academic record.

After all, humans can also produce writing with serious errors, and mechanisms such as peer review often fail to prevent its publication.

We need to get better at ensuring academic papers are free from serious mistakes, regardless of whether these mistakes are caused by careless use of AI or sloppy human scholarship. Not only is this more achievable than policing AI usage, it will improve the standards of academic research as a whole.

This would be (as ChatGPT might say) a commendable and meticulously intricate solution.

  • Artificial intelligence (AI)
  • Academic journals
  • Academic publishing
  • Hallucinations
  • Scholarly publishing
  • Academic writing
  • Large language models
  • Generative AI

dissertations on ai

Lecturer / Senior Lecturer - Marketing

dissertations on ai

Research Fellow

dissertations on ai

Senior Research Fellow - Women's Health Services

dissertations on ai

Assistant Editor - 1 year cadetship

dissertations on ai

Executive Dean, Faculty of Health

Accelerate your dissertation literature review with AI

Accelerate your dissertation literature review with AI

Become a lateral pioneer.

Get started for free and help craft the future of research.

Early access. No credit card required.

Introduction

Dissertation writing is part of being a graduate student. There are many different ways to organise your research, and several steps to this process . Typically, the literature review is an early chapter in the dissertation, providing an overview of the field of study. It should summarise relevant research papers and other materials in your field, with specific references. To understand how to write a good literature review, we must first understand its purpose. The goals of a literature review are to place your dissertation topic in the context of existing work (this also allows you to acknowledge prior contributions, and avoid accusations of plagiarism), and to set you up to show you are making a new contribution to the field. Since literature review is repetitive, many students find it tedious. While there are some traditional tools and techniques to help, covered below, they tend to be cumbersome and keyword-based. For this reason, we built a better tool for research and literature review, which I describe in the last section. You can see the Lateral tool in action , and how it makes the literature review a lot easier. To sign up to the tool, click here.

1. Different kinds of reading

We can divide the activity of reading for research into three different kinds: 

  • Exploratory reading, mostly done in the initial phase;
  • Deep reading of highly informative sources; and 
  • Broad, targeted skim reading of large collections of books and articles, in order to find specific kinds of information you already know exist.

1.1. Exploratory reading

Initially, a research student will need to read widely in a new field to gain fundamental understanding. In this early stage, the goal is to explore and digest the main ideas in existing research. Traditionally, this phase has been a manual process, but there is a new generation of digital tools to aid in getting a quick overview of your field, and more generally to organise your research . This stage can happen both before and after the research topic or question has been formulated. It is often unstructured and full of serendipitous (“happy accidental”) discovery  — the student’s job is to absorb what they find, rather than to conduct a targeted search for particular information. ‍

Put another way: You don’t know what you’re looking for ahead of time. By the end of this phase, you should be able to sketch a rough map of your field of study.

1.2. Narrow, deep reading

After the exploratory reading phase, you will be able to prioritise the information you read. Now comes the second phase: Deep, reflective reading. In this phase, your focus will narrow to a small number of highly relevant sources — perhaps one or two books, or a handful of articles — which you will read carefully, with the goal of fully understanding important concepts. This is a deliberative style of reading, often accompanied by reflective pauses and significant note taking. If the goal in the first phase was sketching a map of the globe, the goal in this second phase is to decide which cities interest you most, and map them out in colour and detail.

1.3. Broad, targeted reading

You have now sketched a map of your field of study (exploratory reading), and filled in some parts of this map in more detail (narrow, deep reading). I will assume that by this point, you have found a thesis question or research topic, either on your own, or with the help of an advisor. This is often where the literature review begins in earnest. In order to coherently summarise the state of your field, you must review the literature once again, but this time in a more targeted way: You are searching for particular pieces of information that either illustrate existing work, or demonstrate a need for the new approach you will take in your dissertation. For example, 

  • You want to find all “methodology” sections in a group of academic articles, and filter for those that have certain key concepts;
  • You want to find all paragraphs that discuss product-market fit, inside a group of academic articles.

To return to the map analogy: This is like sketching in the important roads between your favourite cities — you are showing connections between the most important concepts in your field, through targeted information search.

dissertations on ai

2. Drawbacks of broad targeted reading

The third phase — broad, targeted reading, where you know what kind of information you’re looking for and simply wish to scan a collection of articles or books to find it — is often the most mechanical and time consuming one. Since human brains tend to lose focus in the face of dull repetition, this is also a tedious and error-prone phase for many people. What if you miss something important because you’re on autopilot? Often, students end up speed- or skim reading through large volumes of information to complete the literature review as quickly as possible. With focus and training, this manual approach can be efficient and effective, but it can also mean reduced attention to detail and missed opportunities to discover relevant information. Only half paying attention during this phase can also lead to accidental plagiarism, otherwise known as cryptomnesia: Your brain subconsciously stores a distinctive idea or quote from the existing literature without consciously attributing it to its source reference. Afterwards, you end up falsely, but sincerely believing you created the idea independently, exposing yourself to plagiarism accusations.

3. Existing solutions to speed up literature reviews

Given the drawbacks of manual speed- or skim-reading in the broad reading phase, it’s natural to turn to computer-driven solutions. One popular option is to systematically create a list of search term keywords or key phrases, which can then be combined using boolean operators to broaden results. For example, in researching a study about teenage obesity, one might use the query:

  • “BMI” or “obesity” and “adolescents” and not “geriatric”,

to filter for obesity-related articles that do mention adolescents, but don’t mention older adults.

Constructing such lists can help surface many relevant articles, but there are some disadvantages to this strategy:

  • These keyword queries are themselves fiddly and time-consuming to create.
  • Often what you want to find is whole “chunks” of text — paragraphs or sections, for example — not just keywords.
  • Even once you have finished creating your boolean keyword query list, how do you know you haven’t forgotten to include an important search query?

This last point reflects the fact that keyword searching is “fragile” and error-prone: You can miss results that would be relevant — this is known as getting “false negatives” — because your query uses words that are similar, but not identical to words appearing in one or more articles in the library database. For example, the query “sporting excellence” would not match with an article that mentioned only “high performance athletics”.

4. Lateral — a new solution

To make the process of finding specific information in big collections of documents quicker and easier — for example, in a literature review — search, we created the Lateral app , a new kind of AI-driven interface to help you organise, search through and save supporting quotes and information from collections of articles. Using techniques from natural language processing, it understands, out-of-the-box, not only that “sporting excellence” and “high-performance” athletics are very similar phrases, but also that two paragraphs discussing these topics in slightly different language are likely related. Moreover, it also learns to find specific blocks of information, given only a few examples. Want to find all “methodology” sections in a group of articles? Check. How about all paragraphs that mention pharmaceutical applications? We have you covered. If you’re interested, you can sign up today .

5. Final note — novel research alongside the literature review

Some students, to be more efficient, use the literature review process to collect data not just to summarise existing work, but also to support one or more novel theses contained in their research topic. After all, you are reading the literature anyway, so why not take the opportunity to note, for example, relevant facts, quotes and supporting evidence for your thesis? Because Lateral is designed to learn from whatever kind of information you’re seeking, this process also fits naturally into the software’s workflow.

References:

  • Is your brain asleep on the job?: https://www.psychologytoday.com/us/blog/prime-your-gray-cells/201107/is-your-brain-asleep-the-job
  • Tim Feriss speed reading: https://www.youtube.com/watch?v=ZwEquW_Yij0
  • Five biggest reading mistakes: https://www.timeshighereducation.com/blog/five-biggest-reading-mistakes-and-how-avoid-them
  • Skim reading can be bad: https://www.inc.com/jeff-steen/why-summaries-skim-reading-might-be-hurting-your-bottom-line.html
  • Cryptomnesia: https://en.wikipedia.org/wiki/Cryptomnesia
  • Systematic literature review with boolean keywords: https://libguides.library.cqu.edu.au/c.php?g=842872&p=6024187

Lit review youtube intro: https://www.youtube.com/watch?v=bNIG4qLuhJA

Spread the word

dissertations on ai

There is a better way than Dropbox and Google Drive to do collaborative research

In this blog, I describe the limitations of Dropbox and Google in the space of research, and propose Lateral as the much needed alternative.

dissertations on ai

Remote group work and the best student collaboration tools

In this blog, I outline some organisational techniques and the best digital collaborative tools for successful student group work.

dissertations on ai

6 things to consider and organise before writing your dissertation (and how Lateral can help)

I hope the following six things to consider and organise will make the complex dissertation writing more manageable.

Get into flow.

dissertations on ai

AI on Trial: Legal Models Hallucinate in 1 out of 6 Queries

A new study reveals the need for benchmarking and public evaluations of AI tools in law.

Scales of justice illustrated in code

Artificial intelligence (AI) tools are rapidly transforming the practice of law. Nearly  three quarters of lawyers plan on using generative AI for their work, from sifting through mountains of case law to drafting contracts to reviewing documents to writing legal memoranda. But are these tools reliable enough for real-world use?

Large language models have a documented tendency to “hallucinate,” or make up false information. In one highly-publicized case, a New York lawyer  faced sanctions for citing ChatGPT-invented fictional cases in a legal brief;  many similar cases have since been reported. And our  previous study of general-purpose chatbots found that they hallucinated between 58% and 82% of the time on legal queries, highlighting the risks of incorporating AI into legal practice. In his  2023 annual report on the judiciary , Chief Justice Roberts took note and warned lawyers of hallucinations. 

Across all areas of industry, retrieval-augmented generation (RAG) is seen and promoted as the solution for reducing hallucinations in domain-specific contexts. Relying on RAG, leading legal research services have released AI-powered legal research products that they claim  “avoid” hallucinations and guarantee  “hallucination-free” legal citations. RAG systems promise to deliver more accurate and trustworthy legal information by integrating a language model with a database of legal documents. Yet providers have not provided hard evidence for such claims or even precisely defined “hallucination,” making it difficult to assess their real-world reliability.

AI-Driven Legal Research Tools Still Hallucinate

In a new preprint study by  Stanford RegLab and  HAI researchers, we put the claims of two providers, LexisNexis and Thomson Reuters (the parent company of Westlaw), to the test. We show that their tools do reduce errors compared to general-purpose AI models like GPT-4. That is a substantial improvement and we document instances where these tools can spot mistaken premises. But even these bespoke legal AI tools still hallucinate an alarming amount of the time: these systems produced incorrect information more than 17% of the time—one in every six queries.

Read the full study, Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools

To conduct our study, we manually constructed a pre-registered dataset of over 200 open-ended legal queries, which we designed to probe various aspects of these systems’ performance.

Broadly, we investigated (1) general research questions (questions about doctrine, case holdings, or the bar exam); (2) jurisdiction or time-specific questions (questions about circuit splits and recent changes in the law); (3) false premise questions (questions that mimic a user having a mistaken understanding of the law); and (4) factual recall questions (questions about simple, objective facts that require no legal interpretation).

Bar chart showing rate of hallucinations and incomplete responses from three different tools

Figure 1: Comparison of hallucinated (red) and incomplete (yellow) answers across generative legal research tools.

These systems can hallucinate in one of two ways. First, a response from an AI tool might just be  incorrect —it describes the law incorrectly or makes a factual error. Second, a response might be  misgrounded —the AI tool describes the law correctly, but cites a source which does not in fact support its claims.

Given the critical importance of authoritative sources in legal research and writing, the second type of hallucination may be even more pernicious than the outright invention of legal cases. A citation might be “hallucination-free” in the narrowest sense that the citation  exists , but that is not the only thing that matters. The core promise of legal AI is that it can streamline the time-consuming process of identifying relevant legal sources. If a tool provides sources that  seem authoritative but are in reality irrelevant or contradictory, users could be misled. They may place undue trust in the tool's output, potentially leading to erroneous legal judgments and conclusions.

Examples of incorrect responses to legal queries from various AI tools

Figure 2:  Left: Example of a hallucinated response by Thomson Reuters’s Ask Practical Law AI. The system fails to correct the user's mistaken premise—in reality, Justice Ginsburg joined the Court's landmark decision legalizing same-sex marriage—and instead provides additional false information about the case. Right: Example of a hallucinated response by LexisNexis’s Lexis+ AI.  Casey and its undue burden standard were overruled by the Supreme Court in  Dobbs v. Jackson Women's Health Organization , 597 U.S. 215 (2022); the correct answer is rational basis review.

RAG Is Not a Panacea

a chart showing an overview of the retrieval-augmentation generation (RAG) process.

Figure 3: An overview of the retrieval-augmentation generation (RAG) process. Given a user query (left), the typical process consists of two steps: (1) retrieval (middle), where the query is embedded with natural language processing and a retrieval system takes embeddings and retrieves the relevant documents (e.g., Supreme Court cases); and (2) generation (right), where the retrieved texts are fed to the language model to generate the response to the user query. Any of the subsidiary steps may introduce error and hallucinations into the generated response. (Icons are courtesy of FlatIcon.)

Under the hood, these new legal AI tools use retrieval-augmented generation (RAG) to produce their results, a method that many tout as a potential solution to the hallucination problem. In theory, RAG allows a system to first  retrieve the relevant source material and then use it to  generate the correct response. In practice, however, we show that even RAG systems are not hallucination-free. 

We identify several challenges that are particularly unique to RAG-based legal AI systems, causing hallucinations. 

First, legal retrieval is hard. As any lawyer knows, finding the appropriate (or best) authority can be no easy task. Unlike other domains, the law is not entirely composed of verifiable  facts —instead, law is built up over time by judges writing  opinions . This makes identifying the set of documents that definitively answer a query difficult, and sometimes hallucinations occur for the simple reason that the system’s retrieval mechanism fails.

Second, even when retrieval occurs, the document that is retrieved can be an inapplicable authority. In the American legal system, rules and precedents differ across jurisdictions and time periods; documents that might be relevant on their face due to semantic similarity to a query may actually be inapposite for idiosyncratic reasons that are unique to the law. Thus, we also observe hallucinations occurring when these RAG systems fail to identify the truly binding authority. This is particularly problematic as areas where the law is in flux is precisely where legal research matters the most. One system, for instance, incorrectly recited the “undue burden” standard for abortion restrictions as good law, which was overturned in  Dobbs (see Figure 2). 

Third, sycophancy—the tendency of AI to agree with the user's incorrect assumptions—also poses unique risks in legal settings. One system, for instance, naively agreed with the question’s premise that Justice Ginsburg dissented in  Obergefell , the case establishing a right to same-sex marriage, and answered that she did so based on her views on international copyright. (Justice Ginsburg did not dissent in  Obergefell and, no, the case had nothing to do with copyright.) Notwithstanding that answer, here there are optimistic results. Our tests showed that both systems generally navigated queries based on false premises effectively. But when these systems do agree with erroneous user assertions, the implications can be severe—particularly for those hoping to use these tools to increase access to justice among  pro se and under-resourced litigants.

Responsible Integration of AI Into Law Requires Transparency

Ultimately, our results highlight the need for rigorous and transparent benchmarking of legal AI tools. Unlike other domains, the use of AI in law remains alarmingly opaque: the tools we study provide no systematic access, publish few details about their models, and report no evaluation results at all.

This opacity makes it exceedingly challenging for lawyers to procure and acquire AI products. The big law firm  Paul Weiss spent nearly a year and a half testing a product, and did not develop “hard metrics” because checking the AI system was so involved that it “makes any efficiency gains difficult to measure.” The absence of rigorous evaluation metrics makes responsible adoption difficult, especially for practitioners that are less resourced than Paul Weiss. 

The lack of transparency also threatens lawyers’ ability to comply with ethical and professional responsibility requirements. The bar associations of  California ,  New York , and  Florida have all recently released guidance on lawyers’ duty of supervision over work products created with AI tools. And as of May 2024,  more than 25 federal judges have issued standing orders instructing attorneys to disclose or monitor the use of AI in their courtrooms.

Without access to evaluations of the specific tools and transparency around their design, lawyers may find it impossible to comply with these responsibilities. Alternatively, given the high rate of hallucinations, lawyers may find themselves having to verify each and every proposition and citation provided by these tools, undercutting the stated efficiency gains that legal AI tools are supposed to provide.

Our study is meant in no way to single out LexisNexis and Thomson Reuters. Their products are far from the only legal AI tools that stand in need of transparency—a slew of startups offer similar products and have  made   similar   claims , but they are available on even more restricted bases, making it even more difficult to assess how they function. 

Based on what we know, legal hallucinations have not been solved and the legal profession should turn to public benchmarking and rigorous evaluations of AI tools. 

The authors of this study chose to evaluate “Ask Practical Law AI” because, despite several requests, they were not given access to Thomson Reuters’ other products at the time of the study. The authors look forward to evaluating more tools, but underscore that it should not be incumbent on academic researchers alone to provide transparency and empirical evidence on the reliability of marketed products. 

Paper authors: Varun Magesh is a research fellow at Stanford RegLab. Faiz Surani is a research fellow at Stanford RegLab. Matthew Dahl is a joint JD/PhD student in political science at Yale University and graduate student affiliate of Stanford RegLab. Mirac Suzgun is a joint JD/PhD student in computer science at Stanford University and a graduate student fellow at Stanford RegLab. Christopher D. Manning is Thomas M. Siebel Professor of Machine Learning, Professor of Linguistics and Computer Science, and Senior Fellow at HAI. Daniel E. Ho is the William Benjamin Scott and Luna M. Scott Professor of Law, Professor of Political Science, Professor of Computer Science (by courtesy), Senior Fellow at HAI, Senior Fellow at SIEPR, and Director of the RegLab at Stanford University. 

More News Topics

  • Share full article

Advertisement

Supported by

Guest Essay

Press Pause on the Silicon Valley Hype Machine

dissertations on ai

By Julia Angwin

Ms. Angwin is a contributing Opinion writer and an investigative journalist.

It’s a little hard to believe that just over a year ago, a group of leading researchers asked for a six-month pause in the development of larger systems of artificial intelligence, fearing that the systems would become too powerful. “Should we risk loss of control of our civilization?” they asked.

There was no pause. But now, a year later, the question isn’t really whether A.I. is too smart and will take over the world. It’s whether A.I. is too stupid and unreliable to be useful. Consider this week’s announcement from OpenAI’s chief executive, Sam Altman, who promised he would unveil “new stuff” that “ feels like magic to me.” But it was just a rather routine update that makes ChatGPT cheaper and faster .

It feels like another sign that A.I. is not even close to living up to its hype. In my eyes, it’s looking less like an all-powerful being and more like a bad intern whose work is so unreliable that it’s often easier to do the task yourself. That realization has real implications for the way we, our employers and our government should deal with Silicon Valley’s latest dazzling new, new thing. Acknowledging A.I.’s flaws could help us invest our resources more efficiently and also allow us to turn our attention toward more realistic solutions.

Others voice similar concerns. “I find my feelings about A.I. are actually pretty similar to my feelings about blockchains: They do a poor job of much of what people try to do with them, they can’t do the things their creators claim they one day might, and many of the things they are well suited to do may not be altogether that beneficial,” wrote Molly White, a cryptocurrency researcher and critic , in her newsletter last month.

Let’s look at the research.

In the past 10 years, A.I. has conquered many tasks that were previously unimaginable, such as successfully identifying images, writing complete coherent sentences and transcribing audio. A.I. enabled a singer who had lost his voice to release a new song using A.I. trained with clips from his old songs.

But some of A.I.’s greatest accomplishments seem inflated. Some of you may remember that the A.I. model ChatGPT-4 aced the uniform bar exam a year ago. Turns out that it scored in the 48th percentile, not the 90th, as claimed by OpenAI , according to a re-examination by the M.I.T. researcher Eric Martínez . Or what about Google’s claim that it used A.I. to discover more than two million new chemical compounds ? A re-examination by experimental materials chemists at the University of California, Santa Barbara, found “ scant evidence for compounds that fulfill the trifecta of novelty, credibility and utility .”

Meanwhile, researchers in many fields have found that A.I. often struggles to answer even simple questions, whether about the law , medicine or voter information . Researchers have even found that A.I. does not always improve the quality of computer programming , the task it is supposed to excel at.

I don’t think we’re in cryptocurrency territory, where the hype turned out to be a cover story for a number of illegal schemes that landed a few big names in prison . But it’s also pretty clear that we’re a long way from Mr. Altman’s promise that A.I. will become “ the most powerful technology humanity has yet invented .”

Take Devin, a recently released “ A.I. software engineer ” that was breathlessly touted by the tech press. A flesh-and-bones software developer named Carl Brown decided to take on Devin . A task that took the generative A.I.-powered agent over six hours took Mr. Brown just 36 minutes. Devin also executed poorly, running a slower, outdated programming language through a complicated process. “Right now the state of the art of generative A.I. is it just does a bad, complicated, convoluted job that just makes more work for everyone else,” Mr. Brown concluded in his YouTube video .

Cognition, Devin’s maker, responded by acknowledging that Devin did not complete the output requested and added that it was eager for more feedback so it can keep improving its product. Of course, A.I. companies are always promising that an actually useful version of their technology is just around the corner. “ GPT-4 is the dumbest model any of you will ever have to use again by a lot ,” Mr. Altman said recently while talking up GPT-5 at a recent event at Stanford University.

The reality is that A.I. models can often prepare a decent first draft. But I find that when I use A.I., I have to spend almost as much time correcting and revising its output as it would have taken me to do the work myself.

And consider for a moment the possibility that perhaps A.I. isn’t going to get that much better anytime soon. After all, the A.I. companies are running out of new data on which to train their models, and they are running out of energy to fuel their power-hungry A.I. machines . Meanwhile, authors and news organizations (including The New York Times ) are contesting the legality of having their data ingested into the A.I. models without their consent, which could end up forcing quality data to be withdrawn from the models.

Given these constraints, it seems just as likely to me that generative A.I. could end up like the Roomba, the mediocre vacuum robot that does a passable job when you are home alone but not if you are expecting guests.

Companies that can get by with Roomba-quality work will, of course, still try to replace workers. But in workplaces where quality matters — and where workforces such as screenwriters and nurses are unionized — A.I. may not make significant inroads.

And if the A.I. models are relegated to producing mediocre work, they may have to compete on price rather than quality, which is never good for profit margins. In that scenario, skeptics such as Jeremy Grantham, an investor known for correctly predicting market crashes, could be right that the A.I. investment bubble is very likely to deflate soon .

The biggest question raised by a future populated by unexceptional A.I., however, is existential. Should we as a society be investing tens of billions of dollars, our precious electricity that could be used toward moving away from fossil fuels, and a generation of the brightest math and science minds on incremental improvements in mediocre email writing?

We can’t abandon work on improving A.I. The technology, however middling, is here to stay, and people are going to use it. But we should reckon with the possibility that we are investing in an ideal future that may not materialize.

The Times is committed to publishing a diversity of letters to the editor. We’d like to hear what you think about this or any of our articles. Here are some tips . And here’s our email: [email protected] .

Follow the New York Times Opinion section on Facebook , Instagram , TikTok , WhatsApp , X and Threads .

Julia Angwin, a contributing Opinion writer and the founder of Proof News , writes about tech policy. You can follow her on Twitter or Mastodon or her personal newsletter .

Celestica: Generative AI Boom Is Real - Prospects Remain Bright

Juxtaposed Ideas profile picture

  • Thanks to the promising demand commentaries from NVDA and SMCI, it is apparent that we are in the early innings of generative AI infrastructure boom.
  • With continued demand strength for AI/ ML compute/networking products from hyperscaler customers, we can understand why CLS has raised their FY2024 guidance.
  • Consensus forward estimates imply accelerated top/ bottom line growth as well, thanks to its manufacturing ramp up in Thailand/Malaysia and the shorter enterprise replacement cycle.
  • This is further aided by the robust Free Cash Flow generation and healthier balance sheet, allowing it to capitalize on the robust pricing/ demand trends.
  • Despite the recent rally, we believe that CLS remains a compelling long-term Buy with a double digit upside potential.

Top view of digital brain in lines of streaming data

wigglestick/iStock via Getty Images

We previously covered Celestica Inc. ( NYSE: CLS ) in March 2024, discussing how it had been a beneficiary of the ongoing generative AI boom, with the increased demand for ML/ AI-related products likely to trigger its accelerated top/ bottom line growth through FY2026.

Despite the inherently lower-margin manufacturing business model, we believed that there remained great long-term growth opportunities, attributed to the ramp-up in Thailand/ Malaysia operations, with us initiating a Buy rating then.

Since then, CLS has further rallied by +27.5% well outperforming the wider market at +1.6%. Even so, we are maintaining our Buy rating here, thanks to the promising demand commentaries from Nvidia's ( NVDA ) and Super Micro Computer's ( SMCI ) recent earnings calls.

This is further aided by the CLS management's promising FQ2'24 guidance, growing return on invested capital, and finally, the guidance of growth acceleration on a YoY basis in the coming quarters.

CLS' Investment Thesis Remains Robust, Thanks To NVDA's Impressive Earning Results

For now, CLS has reported a double beat FQ1'24 earnings call, with overall revenues of $2.2B ( +7.8% QoQ / +20.2% YoY), overall operating margins of 6.2% (+0.2 points QoQ/ +1 YoY/ +3.5 from FY2019 levels of 2.7% ), and adj EPS of $0.86 (+22.8% QoQ/ +82.9% YoY).

Much of the tailwinds are attributed to the robust performance observed in the Connectivity & Cloud Solutions segment, with accelerating revenues of $1.44B (+8.2% QoQ/ +38.4% YoY) and expanding operating margins of 7% (+0.3 points QoQ/ +1.2 YoY/ +3.4 from FY2019 levels of 2.6%).

This is not a surprising development indeed, thanks to the " continued demand strength for AI/ML compute and networking products from hyperscaler customers."

This is especially since NVDA recently reported an impressive Q1'24 revenues of $26.04B (+17.8% QoQ/ +262.1% YoY) while guiding Q2'24 revenues of $28B (+7.5% QoQ/ +107.4% YoY), smashing consensus estimates of $22.03B and $26.8B, respectively.

The same has been reported by SMCI, a company offering complete server solutions, with Q1'24 revenues of $3.85B (+5.1% QoQ/ +200% YoY) while guiding Q2'24 revenues of $5.3B (+37.6% QoQ/ +143.1% YoY).

These developments continue to suggest the robust demand for generative AI infrastructures, thanks to their inherent leadership in data center chips and complete server solution markets.

At the same time, NVDA already guides new AI chips every year instead of the previous cadence of every two years, implying accelerated replacement rhythm, naturally triggering further tailwinds for Electronics Manufacturing Services [EMS] and Original Design Manufacturer [ODM] companies, such as CLS.

This is especially since CLS is involved across the broader technology market, with the generative AI boom already triggering notable expansion in the sales of cloud-related electronic components.

As a result, it is a given that CLS has raised its FY2024 guidance to revenues of $9.1B (+14.3% YoY), operating margins of 6.1% (+0.5 points YoY), adj EPS of $3.30 (+35.8% YoY), and adj Free Cash Flow of $250M (+28.9% YoY).

This is up drastically from the previous numbers of $8.5B (+6.7% YoY), 5.75% (+0.15 points YoY), $2.70 (+11.1% YoY), and $200M (+3.1% YoY) offered in the FQ4'23 earnings call, further implying its ability to capitalize on the generative AI infrastructure boom.

The Consensus Forward Estimates

The Consensus Forward Estimates

Tikr Terminal

As a result, it is unsurprising that the consensus forward estimates have been raised again, with CLS expected to generate an accelerated top/ bottom line growth at a CAGR of +11.6%/ +22.2% through FY2025. This is compared to the previous estimates of +9%/ +15.4%, respectively.

Most importantly, despite the higher capex related to the manufacturing ramp up in its expanded facilities in Thailand and Malaysia through 2025, the management continues to report robust Free Cash Flow generation of $65.2M (-22.1% QoQ/ +608.6% YoY) and expanding margins of 2.9% (-1 points QoQ/ +2.4 YoY/ -2.2 from FY2019 levels of 5.1%) in the latest quarter.

This suggests CLS' ability to sustainably fund its growing operations while maintaining a healthy balance sheet with net-debt-to-EBITDA ratio of 0.72x in FQ1'24, compared to 0.76x in FQ4'23, 1.12x in FQ1'23, and 0.68x in FQ4'19.

CLS Valuations

CLS Valuations

Seeking Alpha

As a result, we can understand why the market has awarded CLS with the premium FWD P/E valuations of 17.65x, higher than the previous article at 16.08x and its 1Y mean of 10.86x.

It is undeniable that CLS may be relatively expensive compared to its EMS and ODM competitors, including Flex Ltd. ( FLEX ) at FWD P/E valuations of 13.87x, Hon Hai Precision Industry Co., Ltd. ( OTCPK:HNHPF ) [2317:TW] at 10.65x, and Quanta Computer ( OTC:QUCCF ) at 21.80x.

Then again, the premium appears to be justified for now, due to CLS' accelerated profitable growth prospects over the next few years compared to FLEX at +2.7%/ +14.2% and HNHPF at +10.5%/ +15.1%, while nearing QUCCF at +34.6%/ +24.1%, respectively.

So, Is CLS Stock A Buy , Sell, or Hold?

CLS 5Y Stock Price

CLS 5Y Stock Price

Trading View

For now, CLS has charted a new peak beyond the early 2024 heights, with it also running away from its 50/ 100/ 200 day moving averages.

Despite so, based on the annualized FQ1'24 adj EPS of $3.44 (+22.8% QoQ/ +82.9% YoY) and the raised FWD P/E valuations of 17.65x, the stock appears to be trading near to our fair value estimates of $60.70.

There remains an excellent upside potential of +21.2% to our long-term price target of $71.30 as well, as discussed in our previous article.

As a result, we are maintaining our Buy rating for the CLS stock, though with no specific recommended entry point since it depends on the individual investor's dollar cost average and risk appetite.

With the stock currently buoyed by the highly promising NVDA and SMCI guidance thus far, we believe that interested investors may want to wait for a moderate retracement to its previous trading ranges of between $41s and $49s for an improved margin of safety.

This article was written by

Juxtaposed Ideas profile picture

Analyst’s Disclosure: I/we have a beneficial long position in the shares of NVDA either through stock ownership, options, or other derivatives. I wrote this article myself, and it expresses my own opinions. I am not receiving compensation for it (other than from Seeking Alpha). I have no business relationship with any company whose stock is mentioned in this article. The analysis is provided exclusively for informational purposes and should not be considered professional investment advice. Before investing, please conduct personal in-depth research and utmost due diligence, as there are many risks associated with the trade, including capital loss.

Seeking Alpha's Disclosure: Past performance is no guarantee of future results. No recommendation or advice is being given as to whether any investment is suitable for a particular investor. Any views or opinions expressed above may not reflect those of Seeking Alpha as a whole. Seeking Alpha is not a licensed securities dealer, broker or US investment adviser or investment bank. Our analysts are third party authors that include both professional investors and individual investors who may not be licensed or certified by any institute or regulatory body.

Recommended For You

About cls stock, more on cls, related stocks, trending analysis, trending news.

dissertations on ai

IMAGES

  1. Dissertation Examples

    dissertations on ai

  2. Artificial Intelligence AI Thesis [Novel Research Proposal]

    dissertations on ai

  3. Latest Artificial Intelligence Dissertation Topics [Performance Analysis]

    dissertations on ai

  4. Latest thesis topics in artificial intelligence

    dissertations on ai

  5. Latest Thesis Samples in AI Research| S-Logix

    dissertations on ai

  6. 4 meilleurs outils pour rédiger les dissertations par l'IA

    dissertations on ai

VIDEO

  1. Write a Research Proposal Using AI

  2. How do I write my PhD thesis about Artificial Intelligence, Machine Learning and Robust Clustering?

  3. Theses and Dissertations

  4. How to write a successful Biology Dissertation?

  5. How to Write a Dissertation Introduction

  6. How to Write an Abstract for a Dissertation?

COMMENTS

  1. Understanding Artificial Intelligence Adoption, Implementation, and Use

    This Dissertation is brought to you for free and open access by the Walden Dissertations and Doctoral Studies Collection at ScholarWorks. It has been accepted for inclusion in Walden Dissertations and Doctoral Studies by an ... (AI) adoption, implementation, and use in the small and medium enterprises (SME) sector in India. Increased AI use ...

  2. PDF The Impact of Artificial Intelligence on Higher Education: An Empirical

    2.1 AI impact on the learning and teaching process Dealing with the impact of AI on learning and teaching in higher education, it is evident that AI will impact higher education in many ways and mainly in two focal areas: enrollment and curriculum (Taneri, 2020). For instance, Ma and Siau (2018) maintain that AI

  3. Conclusions

    Conclusions. The field of artificial intelligence has made remarkable progress in the past five years and is having real-world impact on people, institutions and culture. The ability of computer programs to perform sophisticated language- and image-processing tasks, core problems that have driven the field since its birth in the 1950s, has ...

  4. FIU Libraries: Artificial Intelligence: Dissertations & Theses

    Many universities provide full-text access to their dissertations via a digital repository. If you know the title of a particular dissertation or thesis, try doing a Google search. OATD (Open Access Theses and Dissertations) Aims to be the best possible resource for finding open access graduate theses and dissertations published around the world with metadata from over 800 colleges ...

  5. Using artificial intelligence in academic writing and research: An

    Results. The search identified 24 studies through which six core domains were identified where AI helps academic writing and research: 1) facilitating idea generation and research design, 2) improving content and structuring, 3) supporting literature review and synthesis, 4) enhancing data management and analysis, 5) supporting editing, review, and publishing, and 6) assisting in communication ...

  6. PDF The implementation of artificial intelligence and its future ...

    3 (for example, Irving J. Good), logic and philosophy (for example, Alan Turing, Alonzo Church, and Carl Hempel), and linguistics (such as Noam Chomsky's work on grammar).4" However, it wasn't until the furthest half of the 20th century that researchers had enough computing power and programming languages to conduct experiments on the realization of such visions.

  7. PDF The Impact of Artificial Intelligence on Innovation

    ABSTRACT. Artificial intelligence may greatly increase the efficiency of the existing economy. But it may have an even larger impact by serving as a new general-purpose "method of invention" that can reshape the nature of the innovation process and the organization of R&D.

  8. (PDF) Artifical Intelligence and Bias: Challenges, Implications, and

    Abstract. This paper investigates the multifaceted issue of algorithmic bias in artificial intelligence (AI) systems and explores its ethical and human rights implications. The study encompasses a ...

  9. AI and law: ethical, legal, and socio-political implications

    It contains timely and original articles that thoroughly examine the ethical, legal, and socio-political implications of AI and law as viewed from various academic perspectives, such as philosophy, theology, law, medicine, and computer science. The issues covered include, for example, the key concept of personhood and its legal and ethical ...

  10. The role of Artificial Intelligence in future technology

    at our disposal, AI is going to add a new level of ef ficiency and. sophistication to future technologies. One of the primary goals of AI field is to produce fully au-. tonomous intelligent ...

  11. Artificial Intelligence in Education (AIEd): a high-level academic and

    In the past few decades, technology has completely transformed the world around us. Indeed, experts believe that the next big digital transformation in how we live, communicate, work, trade and learn will be driven by Artificial Intelligence (AI) [83]. This paper presents a high-level industrial and academic overview of AI in Education (AIEd). It presents the focus of latest research in AIEd ...

  12. PDF The Utilization of Artificial Intelligence in Healthcare and Its

    Artificial intelligence (AI) is a branch of computer science that involves the use of machines to simulate human intelligence, such as learning and problem-solving. AI encompasses machine learning and natural language processing. Machine learning is a branch of AI that involves the use of machines to learn from new data to improve its

  13. Artificial Intelligence Topics for Dissertations

    Artificial intelligence (AI) is the process of building machines, robots, and software that are intelligent enough to act like humans. With artificial intelligence, the world will move into a future where machines will be as knowledgeable as humans, and they will be able to work and act as humans. When completely developed, AI-powered machines ...

  14. PhD Dissertations

    PhD Dissertations [All are .pdf files] Probabilistic Reinforcement Learning: Using Data to Define Desired Outcomes, and Inferring How to Get There Benjamin Eysenbach, 2023. Data-driven Decisions - An Anomaly Detection Perspective Shubhranshu Shekhar, 2023. METHODS AND APPLICATIONS OF EXPLAINABLE MACHINE LEARNING Joon Sik Kim, 2023. Applied Mathematics of the Future Kin G. Olivares, 2023

  15. Full article: Artificial intelligence in operations management and

    AI is thriving and has become more and more popular in recent years because of various organisational and environmental factors, such as dynamic customer expectations, intense global competition, overall digitalisation in companies, and a rapidly changing technological landscape (Dubey et al. Citation 2019).The three main technological driven forces can be summarised as increasing computing ...

  16. AI-related Dissertation Contest finalists, winner reflect on process

    Through the AI Hub, there have been three AI dissertations nominated and named finalists. This year's AI dissertation finalists include Junjie Liang, Weilin Cong and winner, Zhaohui Li. Li's dissertation addressed challenges that STEM (Science Technology Education Mathematics) educators face in grading selected response questions like ...

  17. Human-Centered Explainable AI for Sequential Decision-Making Systems

    This presents a need for human-centered explainability techniques that help increase everyday users' understanding of complex AI systems. This thesis examines computational methods for explaining sequential decision-making AI systems such that both end users and AI systems benefit in improved understanding and performance. Findings from ...

  18. Artificial Intelligence · University of Basel · Completed Theses

    To solve stochastic state-space tasks, the research field of artificial intelligence is mainly used. PROST2014 is state of the art when determining good actions in an MDP environment. In this thesis, we aimed to provide a heuristic by using neural networks to outperform the dominating planning system PROST2014.

  19. Dissertations / Theses: 'AI, Machine Learning'

    Video (online) Consult the top 50 dissertations / theses for your research on the topic 'AI, Machine Learning.'. Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA ...

  20. PDF ARTIFICIAL INTELLIGENCE IN MARKETING

    Based on the finding of this thesis, it can be concluded that AI is going to fundamentally change how marketers are doing their work, making ads more personalized, predictive, and automated than it has ever been. It is found that AI is an inevitable part of future marketing and sales environment. The sooner

  21. AI-assisted writing is quietly booming in academic journals. Here's why

    An AI tool is your phone's autocomplete function than a human researcher. Another worry is that AI outputs might be biased in ways that could seep into the scholarly record. Infamously, older ...

  22. Accelerate your dissertation literature review with AI

    Typically, the literature review is an early chapter in the dissertation, providing an overview of the field of study. It should summarise relevant research papers and other materials in your field, with specific references. To understand how to write a good literature review, we must first understand its purpose.

  23. Artificial Intelligence Dissertations

    Dissertations on Artificial Intelligence. Artificial Intelligence (AI) is the ability of a machine or computer system to adapt and improvise in new situations, usually demonstrating the ability to solve new problems. The term is also applied to machines that can perform tasks usually requiring human intelligence and thought.

  24. AI for thesis writing

    Justdone. JustDone is an AI for thesis writing and content creation. It offers a straightforward three-step process for generating content, from choosing a template to customizing details and enjoying the final output. AI for thesis writing - Justdone. JustDone AI can generate thesis drafts based on the input provided by you.

  25. AI on Trial: Legal Models Hallucinate in 1 out of 6 Queries

    And our previous study of general-purpose chatbots found that they hallucinated between 58% and 82% of the time on legal queries, highlighting the risks of incorporating AI into legal practice. In his 2023 annual report on the judiciary, Chief Justice Roberts took note and warned lawyers of hallucinations.

  26. A.I. and the Silicon Valley Hype Machine

    Press Pause on the Silicon Valley Hype Machine. Ms. Angwin is a contributing Opinion writer and an investigative journalist. It's a little hard to believe that just over a year ago, a group of ...

  27. Celestica: Generative AI Boom Is Real

    Celestica, a company focused on Gen-AI infrastructure, has charted a new peak beyond the early 2024 heights. See why we maintain a buy rating for CLS stock.