Skoči na glavni sadržaj

Izvorni znanstveni članak

https://doi.org/10.7906/indecs.18.2.2

A Framework for Designing Compassionate and Ethical Artificial Intelligence and Artificial Consciousness

Soumya Banerjee ; University of Oxford


Puni tekst: engleski pdf 553 Kb

str. 85-95

preuzimanja: 473

citiraj

Preuzmi JATS datoteku


Sažetak

Intelligence and consciousness have fascinated humanity for a long time and we have long sought to replicate this in machines. In this work, we show some design principles for a compassionate and conscious artificial intelligence. We present a computational framework for engineering intelligence, empathy, and consciousness in machines. We hope that this framework will allow us to better understand consciousness and design machines that are conscious and empathetic. Our hope is that this will also shift the discussion from fear of artificial intelligence towards designing machines that embed our cherished values. Consciousness, intelligence, and empathy would be worthy design goals that can be engineered in machines.

Ključne riječi

artificial intelligence, artificial consciousness, engineering intelligence, empathy

Hrčak ID:

238034

URI

https://hrcak.srce.hr/238034

Datum izdavanja:

15.5.2020.

Posjeta: 1.648 *




INTRODUCTION

Consciousness has intrigued humanity for centuries. It is only now with the emergence of complex systems science and systems biology that we are beginning to get a deeper understanding of consciousness. Here we introduce a computational framework for designing artificial consciousness and artificial intelligence with empathy. We hope this framework will allow us to better understand consciousness and design machines that are conscious and empathetic. We also hope our work will help shift the discussion from a fear of artificial intelligence towards designing machines that embed our values in them. Consciousness, intelligence and empathy would be worthy design goals that can be engineered in machines.

ARCHITECTURE FOR ARTIFICIAL INTELLIGENCE

In this section we outline a computational architecture for intelligent machines that can be considered to be conscious and empathetic. The key components of this architecture are described below and shown in Figure 1. Each component is also described in detail in the subsequent sections.

Figure 1. Computational architecture of artificial intelligence with consciousness and empathy.
indecs-18-85-g1.png

CORE COMPUTATIONAL ENGINE The core engine is a deep learning framework which is able to analyze data and formulate concepts. The different hidden layers of the neural network will represent concepts. For example, say the deep learning framework analyzes data from an oscillating pendulum. It will build an internal model representation of this data and discover an invariant like the period of oscillation (Figure 2)[1]. The deep learning framework will also analyze this data and form a concept of a period of a pendulum. This will also be reinforced by human input on what that concept actually means and by parsing the corresponding definition from an open-source knowledge corpus (like Wikipedia) (shown in Figure 2). We hypothesize that the more these machines connect these disparate sources of information and formulate concepts, the moreknowledge it will have and the more intelligent and conscious it will become. These machines will also collaborate with other machines and human operators to communicate and share concepts; together they will build on these concepts to generate additional insight and knowledge. Our hope is that humans and networks of machines can collaborate, leading to a collective amplified intelligence. INTELLIGENCE The machine will assimilate knowledge from supervised human input and an open source knowledge corpus (like Wikipedia). It will use a deep learning framework to analyze data and formulate concepts (the different hidden layers will represent concepts). It will thengenerate internal models, experiment with variations of these models, and formulate concepts to explain these (similar to a previous framework[1]). Let us again assume that the machine is given data on an oscillating pendulum. It will form an internal model and mutate that model (using genetic programming or Bayesian techniques) whilst ensuring the model predictions match the empirical data (Figure 2). It will then analyze this to find invariants (quantities that do not change) like the period of oscillation of a pendulum. Performing this in a deep learning framework, the hidden layers of the network will represent the concept of a period of oscillation. These concepts will be fed into the explainability module which will translate this concept to a human understandable format. Human operators will then reinforce this concept with dynamical systems). Operators can help generalize the concept of period of oscillation to other dynamical systems and help reinforce this concept (similar to how a teacher would teach this subject).

Figure 2. A Bayesian approach to artificial intelligence. The framework gets prior model specifications from human operators, mutates the models and checks consistency with empirical data. The hidden layers of the deep learning network represent a concept (for example, the invariant of a dynamical system). This concept is further reinforced by human operators(who help generalize that concept) and input from an open-source knowledge corpus.
indecs-18-85-g2.png

We note that this module can also be implemented in a Bayesian setting (Figure 2). The different models can be varied or mutated by using Markov Chain Monte Carlo in a non- parametric Bayesian model. Additional information can be provided from human operators using priors in a Bayesian model[2]. EMPATHY MODULE The empathy module would build and simulate a minimal mental model of others (robots or humans) so that they can be understood and empathized with. We note that this is a computationally expensive module (see Section Designing Empathy in Machines). This would require enforcing some constraints on how much time is spent on processing that information. CONSCIOUSNESS MODULE We define consciousness as what information processing feels like in a complex system (defined in detail in Section An Information Theoretic View of Artificial Consciousness). The consciousness module (Figure 4) is a deep learning network with feedback to analyze itself. The critical factor is that the module would feedback into itself. It would also need inputs from the intelligence module. We argue, like others before[3], that consciousness is what information processing feels like. Due to learning and human feedback, consciousness can also be learnt over time (bicameral theory of mind and consciousness[4]). We hypothesize that this proposed engineered system would build up consciousness over time. Communication with other robots and humans is also a critical component in order to build a collective intelligence. These machines will communicate with other artificially intelligent machines. Finally, this module will have agents (like in agent based models) combined with machine learning or reinforcement learning. Consciousness will be an emergent property of this system. DREAM MODULE The dream module is a sandbox environment to simulate all other modules with no inhibitions. This is similar to DeepDream[5][6] with feedback into itself. This module also has connections to the experiment module. EXPERIMENT MODULE The experiment module will play or experiment with systems in a protected sandbox environment (similar to another framework[1]). The input to this module would be data on a particular system of interest. The module would make a model, perturb this model, and observe how well is it consistent with data. The output of this module would be fed into a neural network to form concepts (the hidden layers of the deep learning network would represent concepts). EXPLAINABILITY MODULE This module will run interpretable machine learning algorithms to allow human operators to understand actions taken by the machine and get insights into mechanisms behind the learning. UNSUPERVISED LEARNING We propose that these systems should also incorporate some information from curated repositories like the Wikipedia knowledge corpus, similar to what was done for the IBM Watson project. SUPERVISED LEARNING The concepts formed by the engine and the output of other modules will be tested by humans. Humans will interact with the explainability module to understand why these particular actions were taken or concepts formed. Human operators would ensure that these machines have a broad goal or purpose and that all their actions are consistent with some ethical structure (like Asimov’s Laws of Robotics). Human operators will also try to minimize the harm to the machines themselves. We expect that different machines will form distinct personalities based on their system state, data and human input. This is an opportunity for us to personalize these machines (if used in homes). This step is also the most vulnerable; humans with malicious intent can embed undesirable values in machines and hence considerable care should be taken in supervising these machines. MEMORY Finally, the machine will have memory. It will record all current and previous states of its artificial neural network, learning representations and interactions with human operators. It will have the capability to play back these memories (in a protected sandbox environment), perform role-playing and simulate future moves. This will need to be done in the dream module. Our hope is that machines will be able to use past memories to learn from them, generate new future scenarios from past data and train on this data. These “reveries” will allow the machines to effectively use data on past actions to generate new knowledge. This reverie architecture generates new knowledge by combining stored information with a capability to process that information (Figure 3). In a certain sense, these machines will be able to learn from past mistakes and adapt to different scenarios. We hypothesize that this will lead to higher levels of intelligence and ultimately lead to a form of consciousness.

Figure 3. Using past memory to generate new knowledge. Past data is used to generate training data which is combined with new models. This yields new insights and knowledge into a system of interest. This “reverie” architecture generates new knowledge by combining stored information with a capability to process that information.
indecs-18-85-g3.png
CONSCIOUSNESS, INTELLIGENCE AND LIFE: PERSPECTIVES FROM INFORMATION PROCESSING

We hypothesize that consciousness, intelligence and life are different forms of information processing in complex systems[7]. Information and the computational substrate needed to process that information serve as the basis of life, intelligence and consciousness. The minimal computational unit needed to create this artificial intelligence can be an artificial neuron, reaction diffusion computers[7][8] or neuromorphic computing systems[9]. Any of these computing substrates can be used to implement the architecture shown in Figure 1.

AN INFORMATION THEORETIC VIEW OF ARTIFICIAL CONSCIOUSNESS

Consciousness is characterized by feedback loops in a complex system. Consciousness is what information processing feels like when there are feedback loops in a complex system that processes information[1][3][10][11]. Consciousness also has been hypothesized to be an emergent property of a complex system[12]. It is like asking what makes water liquid; it is not only a property of the water molecule but also an emergent property of the entire system of billions of water molecules. Hence, we hypothesize that consciousness is also an emergent property of a complex information processing system with feedback. DESIGNING CONSCIOUSNESS IN MACHINES How can we encode these principles and design consciousness in a computer? A tentative basic definition of a conscious machine is a “A computing unit that can process information and has feedback into itself”. An architecture of a consciousness module is presented below and shown in Figure 4. Would a computer recognize that it is a computer? We can show a computer images of other computers to help it recognize itself (using deep learning based image recognition algorithms). We can also for example show the machine images of a smartphone, birds and buildings to reinforce the concept that it is not any of these things (non-self). Finally, we can design an algorithm to select out all images of non-self; all that remains is self. This kind of an algorithm can be used to design a sense of self in machines. Such a supervised learning approach is similar to negative selection in biology[13] where the immune system learns to discriminate between all cells in the body (self), versus all that is foreign and potentially pathogenic (non-self). A complementary approach is to exhaustively define all qualities that uniquely define self like the size of the computer, colour, amount of memory, identification marks, memory of past actions taken by this machine, etc. We can also show the computer an image of itself. All these attributes can then be coupled to a deep-learning based image recognition program or self- recognition program. The different hidden layers of a deep learning module would encapsulate the concept of self (based on images or other attributes). Both these strategies can be combined to design a basic level of self-awareness and consciousness in computers (Figure 4). We see some parallels to the turtle robots designed by William Grey Walter that could sense, move, and recharge[14]. These simple robots could follow a light source with a sensor, move around and then recharge its batteries when required. In one experiment, a light source was placed on top of the robot and the robot itself placed in front of a mirror. It was claimed that the robot began to twitch; W.G. Walter suggested that this robot showed a simple form of self-awareness[14]. We also argue that this is a basic level of consciousness and self-awareness. Similar principles can be used to design consciousness in machines. Finally, we note that there are some well-known cognitive architectures that can be used to implement this form of artificial intelligence[15].

Figure 4. Architecture of the consciousness module. An artificial neural network analyzes the input and output of the core engine and all other modules (in Figure 1). This is combined with all the attributes of this particular machine (colour, physical characteristics, the fact that it is not a smartphone, it is not a bird, etc.). The hidden layers of the artificial neural network represent the concept of self.
indecs-18-85-g4.png

HUMAN SUPERVISION AND INTERPRETABILITY Conscious machines will also need to explain themselves. The biological brain sometimes struggles to explain itself to others. Sometimes people find it hard to explain their actions or motives. Why do we love someone? Why do we feel afraid when we see a snake? The brain is a complex information processing system that does not lend itself very well to explanation. Some progress has been made in making artificial neural networks interpretable[16][17]. These are approaches where artificial neural networks are turned on themselves to analyze their own actions. Similar techniques can be used to implement an explainability module (Figure 1) that can explain specific actions taken by the machine to human operators. Making artificial intelligence interpretable or explainable will help human operators understand machines and help guide their training[18]. DESIGNING EMPATHY IN MACHINES Empathy is when we try to deeply understand another person. The brain is like a Turing machine, and empathy is similar to running another Turing machine within it. Simulating a Turing machine with another Turing machine and asking the question whether it will ever halt is undecidable (Halting problem). We hypothesize that in general empathy is undecidable. It is also computationally expensive, which is perhaps why biological organisms do not have a lot of empathy. Empathy is also intimately connected with a sense of self. Having a sense of self is essential for survival and maybe why evolutionarily it is important to have consciousness. There are people called synesthete who have a heightened sense of compassion for other people. They feel intense emotions and empathy for other people to the point where human interactions exhaust them and they can become homebound. Essentially they are simulating other people and feeling what other people are feeling. They also find it difficult to separate their own self from other people. Hence the reason we have a sense of self. We hypothesize that having a sense of self aids survival and delineates self from prey or predator. This may also be the reason we do not have a lot of empathy. If we did, we would not have a strong sense of self and may be at a selective disadvantage. Empathy and consciousness are also related. The ability to run a simulation of what another person is feeling like (simulating another person’s mental state) is empathy. Apart from being undecidable in general, empathy is also inversely related to a sense of self and hence maybe at a selective disadvantage. Evolution may have decided that a lot of empathy is not good for individual survival. However, we have the unique opportunity of being able to engineer machines that have more empathy than biological organisms. We suggest the possibility of programming empathy in a computer. We may have to impose limits on how much time to simulate another person or another machine’s state. In general this is undecidable, but we may be able to implement fast approximations. DISCUSSION We present a computational framework for engineering intelligence, empathy and consciousness in machines. This architecture can be implemented on any substrate that is capable of computing and information processing[7],[8],[19],[20][21][22][23][24][25][26][27][28][29][30][31][32][33][34]. We tentatively define consciousness as what information processing feels like in a complex system[3]. Consciousness is also like having a sense of self and is an emergent property of a complex information processing system with feedback. Our proposed architecture for intelligent, conscious and empathetic machines will assimilate knowledge using supervised learning, form concepts (using a deep learning framework) and experiment in a sandbox. Our proposed machines will be capable of a form of consciousness by using feedback. They will also be capable of empathy by simulating the artificial neural state or conditions of other machines or humans. We hypothesize that empathy and emotions have been pre-programmed over evolution. Empathy may confer an evolutionary advantage. We also recognize why too much empathy can be a disadvantage. Empathy can be achieved in robots through operator training (reinforcement learning) and allowing machines to analyze the artificial neural state of other machines or personal history of humans. We have the unique opportunity of being able to engineer machines that may have more empathy than biological organisms. Communication technologies and human supervision can help accelerate the onset of artificial consciousness as was hypothesized to have led to the emergence of consciousness in humans[4]. Consciousness can be learnt over time as has been hypothesized before[4]. We suggest a computational approach to engineer this in machines with close human supervision. Ultimately, we may be able to engineer higher levels of consciousness. More levels of feedback and more complexity in information processing may lead to higher levels of consciousness. The union of machine intelligence with our biological intelligence may also give us access to higher levels of consciousness. Computing paradigms that are not constrained by physical space or have different computing substrates (as proposed in different information processing systems[7][35] and in biology[20][21][22][23][24][25][26][27][28][29][30][31][32][33][34]) may be capable of higher levels of consciousness. Our greatest contribution as a species may be that we introduce non-biological consciousness into the Universe. We foresee a number of dangers. The scope of this computational framework (presented in Figure 1) is very broad and maybe currently be beyond the reach of individuals and only be feasible by giant corporations. Malicious corporations and conglomerates of individuals may misuse such an artificial intelligence, by for example failing to invest in empathy. It may be worthwhile to create non-profits that advocate for designing empathy in future intelligent machines and also educate the public about the potential benefits of such technologies. Another danger is that we mistreat these artificial creations. What ethics might we need to create for conscious machines[36]? Would it be ethical to turn off or destroy such an artificially intelligent and conscious being? The creation of artificially conscious and intelligent machines will challenge us to come up with new ethical structures. It may be the first time that consciousness would have been engineered rather than self-emerge and these beings would deserve as much sympathy as we show towards other species. We hope that this framework will allow us to better understand consciousness and design machines that are conscious and empathetic. We hope this will also shift the discussion from a fear of artificial intelligence towards designing machines that embed our cherished values in them. Consciousness, intelligence and empathy would be worthy design goals that can be engineered in machines.

ACKNOWLEDGEMENTS

The author would like to thank Irene Egli, Chayan Chakrabarti, Tarakeswar Banerjee, Kalyani Banerjee, Linda Iglehart, Peter Grindrod, Franziska Diethelm and Joyeeta Ghose for fruitful discussions.

References

1 

Lipson H. 2009. Distilling free-form laws from experimental data. Science, 234, 5923 81-85.

2 

Banerjee S. 2017. Modelling the effects of phylogeny and body size on within-host pathogen replication and immune response. Journal of The Royal Society Interface, 14, 136 1-17. https://doi.org/10.1098/rsif.2017.0479http://dx.doi.org/10.1098/rsif.2017.0479

3 

Tegmark M. 2015. Consciousness as a State of Matter. Chaos, Solitons & Fractals, 76, 238-270. https://doi.org/10.1016/j.chaos.2015.03.014http://dx.doi.org/10.1016/j.chaos.2015.03.014

4 

Jaynes J.. The origin of consciousness in the breakdown of the bicameral mind. Boston: Houghton Mifflin Harcourt, 1976.

5 

Alexander, M.. Inceptionism: Going Deeper into Neural Networks.. (2015) Google Al Blog. 30th December 2017. https://research.googleblog.com/2015/06/inceptionism-going-deeper-into-neural.html

7 

Banerjee S. 2016. A Roadmap for a Computational Theory of the Value of Information in Origin of Life Questions. Interdisciplinary Description of Complex Systems, 14, 3 314-321. https://doi.org/10.7906/indecs.14.3.4http://dx.doi.org/10.7906/indecs.14.3.4

8 

Adamatzky A. 2002. Experimental logical gates in a reaction- diffusion medium: The XOR gate and beyond. Physical Review E, 66, 4

9 

Schemmel J.. A wafer-scale neuromorphic hardware system for large-scale neural modeling. Piscataway: Institute of Electrical and Electronics Engineers, 2010.

10 

Kaku M.. The future of the mind: The scientific quest to understand, enhance, and empower the mind. New York: Anchor Books, 2015.

11 

Chalmers D.J.. The character of consciousness. Oxford: Oxford University Press, 2010.

12 

Hofstadter D.R.. Godel, Escher, Bach: An eternal golden braid. New York: Basic Books, 1999.

13 

Janeway C.A.. Immunobiology: The immune system in health and disease. New York: Garland Science, 2005.

14 

Boden M. 2006-2007. Grey Walter’s Anticipatory Tortoises. The Rutherford Journal, 2,>http://rutherfordjournal.org/article020101.html

15 

-. University of Michigan Soar Group. 30th December 2017. https://github.com/SoarGroup

16 

Salakhutdinov R.. Restricted Boltzmann machines for collaborative filtering. New York: Association for Computing Machinery, 2007.

17 

Yosinski J.. Understanding neural networks through deep visualization. New York: Cornell University, 2015.

18 

Bornstein A.M. 2016. Is Artificial Intelligence Permanently Inscrutable?. Nautilus, 40,>http://nautil.us/issue/40/Learning/is-artificial-intelligence-permanently-inscrutable

19 

Vallverdu J. 2018. Slime mould: the fundamental mechanisms of cognition. Biosystems, 165, 57-70. https://doi.org/10.1108/16605371111127224http://dx.doi.org/10.1016/j.biosystems.2017.12.011

20 

Drew L. 2016. A spatial model of the efficiency of T cell search in the influenza-infected lung. Journal of Theoretical Biology, 398, 7 52-63. https://doi.org/10.1016/j.jtbi.2016.02.022http://dx.doi.org/10.1016/j.jtbi.2016.02.022

21 

Banerjee S. 2016. Estimating biologically relevant parameters under uncertainty for experimental within-host murine West Nile virus infection. Journal of The Royal Society Interface, 13, 117 https://doi.org/10.1098/rsif.2016.0130http://dx.doi.org/10.1098/rsif.2016.0130

22 

Balch C. 2014. Science and technology consortia in US biomedical research: A paradigm shift in response to unsustainable academic growth. BioEssays, 37, 2 119-122. https://doi.org/10.1002/bies.201400167http://dx.doi.org/10.1002/bies.201400167

23 

Banerjee S.. A Multi-Agent System Approach to Load-Balancing and Resource Allocation for Distributed Computing. : Complex Systems Digital Campus 2015, 2015.

24 

Banerjee S.. Immune System Inspired Strategies for Distributed Systems. : , 2010.

25 

Banerjee S. 2010. Scale Invariance of Immune System Response Rates and Times: Perspectives on Immune System Architecture and Implications for Artificial Immune Systems. Swarm Intelligence, 4, 301-318. https://doi.org/10.1007/s11721-010-0048-2http://dx.doi.org/10.1007/s11721-010-0048-2

26 

Banerjee S. 2016. Estimating biologically relevant parameters under uncertainty for experimental within-host murine West Nile virus infection. Journal of the Royal Society Interface, 13, 117 https://doi.org/10.1098/rsif.2016.0130http://dx.doi.org/10.1098/rsif.2016.0130

27 

Banerjee S.. An Immune System Inspired Approach to Automated Program Verification. : , 2009.

28 

Banerjee S.. Scaling in the immune system. Albuquerque: University of New Mexico, 2013.

29 

Banerjee S. 2016. A Biologically Inspired Model of Distributed Online Communication Supporting Efficient Search and Diffusion of Innovation. Interdisciplinary Description of Complex Systems, 14, 1 10-22. https://doi.org/10.7906/indecs.14.1.2http://dx.doi.org/10.7906/indecs.14.1.2

30 

Banerjee S. 2017. A computational technique to estimate within-host productively infected cell lifetimes in emerging viral infections. Interdisciplinary Description of Complex Systems, 15, 3 190-198. https://doi.org/10.7906/indecs.15.3.3http://dx.doi.org/10.7906/indecs.15.3.3

31 

Banerjee S. 2017. An artificial immune system approach to automated program verification: Towards a theory of undecidability in biological computing. PeerJ Preprints, 5, https://doi.org/10.7287/peerj.preprints.2690v3http://dx.doi.org/10.7287/peerj.preprints.2690v3

32 

Banerjee S. 2017. An Immune System Inspired Theory for Crime and Violence in Cities. Interdisciplinary Description of Complex Systems, 15, 2 133-143. https://doi.org/10.7906/indecs.15.2.2http://dx.doi.org/10.7906/indecs.15.2.2

33 

Banerjee S.. Analysis of a Planetary Scale Scientific Collaboration Dataset Reveals Novel Patterns. First Complex Systems Digital Campus World E-Conference: Springer, 2017.

34 

Banerjee S.. Optimal strategies for virus propagation, 2015.

35 

Clarke A.C.. The wind from the sun. Canada: Vista, 1996.

36 

Sandberg A. 2014. Ethics of brain emulations. Journal of Experimental & Theoretical Artificial. Intelligence, 26, 3 439-457. https://doi.org/10.1080/0952813X.2014.895113http://dx.doi.org/10.1080/0952813X.2014.895113


This display is generated from NISO JATS XML with jats-html.xsl. The XSLT engine is libxslt.