Human & Artificial Intelligence

Learning Outcomes

1. What is intelligence?

2. What are the origins of modern intelligence tests?

3. Explain some issues with intelligence testing.

4. What are some different views on the nature of intelligence?

5. How do factors other than intelligence contribute to success?

6. Describe the beginnings of artificial intelligence (AI). What is the Turing Test?

7. What is an expert system? How is common sense important to AI?

8. Describe artificial neural networks, and deep learning in particular

9. What are some successful real-world applications of AI?

10. What is the potential future of AI? How does AI relate to consciousness?

What is Intelligence?

• “______ _____”?

• “____________ as a measurable capacity must at the start be defined as the capacity to do well in an intelligence test. Intelligence is what the tests test” (Boring, 1923, p.35)

• an inferred trait, representing the abilities to learn from experience, acquire knowledge, think abstractlly, and adapt to changes

The First Tests

Francis Galton (1822-1911):

- ran a service at South Kensington Museum in London

- would check your intelligence for a fee

- tests mostly __________

- pros & cons:

pioneer in the study of intelligence

contributed much to statistics

concluded success was due to ________

started the ________ movement: improving humanity’s physical and mental composition by selective parenthood

Alfred Binet & Theodore Simon (1905/1916):

- developed a test to identify ____ learners

- found they performed at the level of younger children

- compared ______ age (MA) with chronological age (CA)

- components:

• __________ e.g., “What does misanthrope mean?”

• _____________ e.g., “Why do people sometimes borrow money?”

• ______ relations e.g., “What do an orange, an apple, and a pear have in common?”

- assumptions:

• test shows current performance differences

• purpose: identify children who need special ____

• believed special training could help slow learners catch up

• test not based on a theoretical conception of “____________”

Lewis Terman (1916):

- modified test for use in US (Stanford-Binet Intelligence Scales)

- based on Stern’s (1912) intelligence ________:

IQ = (MA / CA) × 100

- test fine-tuned to make mean score = 100, standard deviation = 15

- pros & cons:

easy to administer and score

hard to _______ children of different chronological ages

could not be applied to ______

test was largely language-based

David Wechsler (1939):

- developed tests:

• Wechsler Adult Intelligence Scale (WAIS)

• Wechsler Intelligence Scale for Children (WISC)

• Wechsler Preschool and Primary Scale of Intelligence (WPPSI)

- provide verbal, performance, and overall scores, plus IQ-equivalents

- included more _________ questions

Issues

• ________

James McKeen Cattell (1890):

- devised tests based on Galton’s work

- assessed by his student, Clark Wissler

- virtually no ___________ between intelligence and college grades

(conventional IQ tests: correlations of .40 to .60)

• tests used extensively during WWI

- Army Alpha: verbal test

- Army Beta: performance test, with __________ directions instead of words

- used to determine job classification and leadership potential (or, who got sent to the front lines)

• intelligence came to be seen as inherited _____, not index of learning performance

• tests __________ biased:

e.g., “I don’t sing for nobody” Single- or double-negative?

e.g., Dove Counterbalance General Intelligence Test (1971), a.k.a. the Chitling Test

- Is one culture less intelligent, or is the test ______?

- Some cultures don’t believe in testing or competitive _______--should they be exempt?

- Do results reveal a lack of _______ important in our society?

- Is the ______ for a low score important?

• most intelligence tests are ____________: depend on the number of correct answers

- possible solution: use other kinds of _____, as well

- check for physiological or psychological problems

- diagnose specific learning disability

• _____ effect (James R. Flynn, 1987; 2000; 2007):

- unstandardized intelligence scores have been increasing over time

▸ 14-point gain from 1932 to 1978

▸ annual gain from 1947-1972 was 0.31 points (0.36 in the 1990s)

▸ greatest increases seen in lower grades, but not for children in grade 12 (limit?)

▸ not all subtests increase equally, thus intelligence is not a ______ entity

- the nature of intelligence has changed from practical to conceptual, due to social change

e.g., What do dogs and rabbits have in ______?

- increases in IQ are not be due to changes in genetics--they must be _____________

▸ influence of television, schooling, parenting, etc.

▸ kids watching basketball games on TV could learn new moves and improve their performance to match star players; their improved performance would challenge and enrich the playing of other kids on the court

▸ jobs increasingly required more logical reasoning and analysis, so schooling focused more on logical reasoning and analysis and more people complete high school, leading to generational changes in abstract reasoning

- this is called the social multiplier effect: a virtuous cycle of skill improvement

Other Intelligences

Charles Spearman (1927):

- is intelligence monolithic, or comprised of different factors?

- factor analysis: determines minimum number of dimensions necessary to explain a pattern of correlations among subtests

- correlated scores on verbal, quantitative, analytical subtests

- some evidence for a general “g ” factor of intelligence

- correlations are not perfect; there are specific factors affecting each subtest

- pros & cons:

more _______ tasks (e.g., academic achievement, job performance) highly correlated with g

other theorists have proposed other intellectual abilities that are uncorrelated with each other

Theory of Multiple Intelligences (Howard Gardner, 2008, 2011b):

- ________ for intelligence:

• isolated by brain damage

• existence of prodigies, savants

• identifiable core operations (e.g., music: melody, harmony, rhythm, etc.)

• distinctive developmental history

• evolutionary plausibility

• support from experimental psychology

• psychometric support

• encodable in a symbol system

- multiple intelligences:

• __________: poets, writers, linguists

• _______-mathematical: mathematicians, scientists, philosophers

• _______: composers, conductors, musicians

• spatial: architects, artists, navigators

• ______-kinesthetic: dancers, athletes, actors

• _____________ (understanding others) and intrapersonal (understanding oneself): psychiatrists, politicians, anthropologists

• naturalist: biologists, naturalists

• (maybe also existential: spiritual leaders?)

- how should these be used?

• descriptive, not ____________

• not in standardized test, but in real life or simulations

• for more effective ________ and assessment

(but there is virtually no evidence that matching type of instruction to “learning style” optimizes learning (Pashler et al., 2008))

- criticisms:

lacks empirical ________; rather, many of Gardner’s intelligences correlate with g

there is no test of multiple intelligences; assessment relies on subjective judgment

“intelligences” are more like “_________” or “aptitudes” (what’s the difference?)

Triarchic Theory of Successful Intelligence (Robert Sternberg, 1985):

1. ____________ (analytical) intelligence

a) ______________: recognizing a problem, selecting a procedure to solve it, checking the results

b) performance components: planning, implementing the procedure

c) _________-acquisition components: learning how to solve a problem

2. ____________ (creative) intelligence

• applying your knowledge on a specific task (automated or novel tasks)

• may require creativity/divergent thinking

3. __________ (practical) intelligence

• ability to a) adapt to, b) shape, or c) select one’s environment

- measured by STAT (Sternberg Triarchic Abilities Test)

- intelligence is more a matter of using what you’ve got, not how much you’ve got

- criticisms:

traditional intelligence tests correlate with income, occupational prestige, and ability to stay out of jail, which are supposedly measures of practical intelligence

Other Factors

Terman & colleagues: “Genetic Studies of Genius” (1925-1959)

- how to bridge gap between potential and achievement?

- 1,528 gifted children with IQ greater than 135 (top 1%) were followed starting in 1921

- compared most vs. least successful (school/work/ambition)

- no __ difference, but difference in __________

Duckworth & Seligman (2005):

- self-discipline in 8th-graders measured by self-report, parent report, teacher report, and monetary choice questionnaire

- self-discipline predicted final grades, attendance, standardized achievement test scores, selection into competitive high school program

- self-discipline was more important than IQ in contributing to final grades

Artificial Intelligence (AI): Beginnings

John von Neumann (1945):

- conceived of stored ________ program for controlling operations of computer hardware (vs. physical switches)

- headed team to develop machine to calculate artillery trajectories for Ballistics Research Laboratory

J. Presper Eckert & John W. Mauchly (1946):

- built _____: Electronic Numerical Integrator And Calculator in 1946

- first electronic large-scale general-purpose digital computer

- first to handle __________ and numeric data

Alan Turing (1950):

- asked, “Can machines _____?”

- developed “the imitation game,” a.k.a. the Turing test:

• judge sits in a room, with an ______ in another room

• communicates with entity via keyboard

• is the entity human or not?

- not proposed as a test of ____________

Newell & Simon (1956):

- created Logic Theorist program

- designed to solve mathematical proofs, play games

- later created General Problem Solver

“Artificial intelligence”coined by John McCarthy in 1956

- programs were not “______”, but didn’t have to be

e.g., flying: birds vs. ornithopters vs. airplanes

Computer Vision

- Marvin Minsky, Terry Winograd attempted robotic vision

- used simplified version of the world: “_____ world”

- robots programmed to “see” and move blocks

- not very __________

e.g., tried to build tower from top down

AI & Language

- Weizenbaum (1966) wrote ELIZA, a virtual Rogerian therapist

• was merely a “bag of ______”

- Chamberlain & Etter (1984) created RACTER (short for “raconteur”)

• BASIC program, running on Z80 chip with 64K RAM

• RACTER “wrote” The Policeman’s Beard is Half Constructed, a collection of ______ and prose

- these are mere “___________”, with primitive knowledge bases; understanding language is hard

Machine Translation

- could work word-by-word (sort of)

- plagued by problems due to inherent _________ of language

e.g., “Mary saw the bicycle in the store window, and she wanted it.”

vs. “Mary saw the bicycle in the store window; she looked at it longingly and pressed her nose up against it.”

- a sentence in a technical journal had 1+ _______ syntactically correct interpretations

Criticisms

Moravec’s paradox (1988):

- for computers, “hard things are easy, and easy things are hard”

- things most people find difficult can easily be done by computers (e.g., playing chess or doing calculus)

- however, abilities that humans find easy or even trivial are very difficult for computers to do (e.g., object perception, understanding context in conversations)

Dreyfus (1972):

- wrote What computers can’t do: A critique of artificial reason

- problems due to fundamental ___________ between humans and computers:

• consciousness

• body to unitize sensory experience

• fatigue, boredom, drive

• intentionality (sense of _______)

Lighthill (1972):

- 20 years of research into AI had been a great ______________

Result: “AI ______” of decreased funding and research.

Classical AI: Expert Systems

- everyday, generalized intelligence is hugely complex

- instead, concentrate on intelligence displayed by experts in a ______ domain

- expert system consists of _________ base developed by knowledge engineer

- uses inference ______ to apply rules to facts, to solve a problem

- number of expert systems has increased to tens of thousands; $1+ billion industry

- most popular: finance, manufacturing control, fault diagnosis

Shortliffe & colleagues (1973): MYCIN

- asks questions like, “Has the patient recently suffered burns?” or “Does the patient have a known allergy to Colistin?”

- knowledge of bacterial infections represented as ~450 _____

- performed as well as _______ (better than med students and residents) at Stanford Medical School (Yu et al., 1979)

Pros & Cons:

highly specialized and accurate

difficult to _________ knowledge of experts

no intuition; could not _____ from mistakes

limited to domain of expertise (“weak” or “narrow” AI)

AI & Common Sense

- scripts (Schank & Abelson, 1977) and frames (Minsky, 1975) developed to aid machine translation

- explicitly represented background knowledge

- give a frame of reference

Lenat (1984, 2017): Cyc (short for “____________”)

- explicitly represents knowledge not found in an encyclopedia

- “common sense knowledge base” of over 24 million hand-entered rules; divided into “_____________”

e.g., “On January 2, Abraham Lincoln was in Washington” implies that:

• Lincoln’s left ___ was probably in Washington, too

• his parents remained his parents for life

• he was in Washington for the whole day

- pros & cons:

successful applications include Terrorism Knowledge Base, natural-language database of medical information, and financial analysis

still a work in progress (cannot separate fact from fiction; hand-entering knowledge is slow and tedious; expensive to develop)

Artificial Neural Networks

all of the above are GOFAI (“Good Old Fashioned Artificial Intelligence”), or classical symbolic AI based on human-coded programming

- the problem with AI is that it doesn’t have a _____

- solution: give it a brain!

- artificial neural networks (ANNs) are inspired by the function of neurons

- a.k.a. PDP approach, or connectionism: emergent properties that arise from interconnected networks of processing units

- instead of being programmed with explicit rules, ANNs apply _______ ________: letting computers develop algorithms iteratively from data

Perceptron:

- early attempt at ANN by Frank Rosenblatt (1958)

- connections could be modified, but network was simple and computationally _______

New Connectionism:

- algorithms were developed to allow ANNs to self-modify connections in the mid-1980s

- result: ANNs with more sophisticated neurons used feedback to ______ connection weights (supervised learning)

- were able to _____ from experience

- had greater computational power; could solve more complex problems

- however, the computational limitations of this approach eventually became apparent leading to a second AI winter in the 1990s

Deep Learning:

- new approach to ANNs inspired by the structure and organization of the ________ ______, starting in the mid-2000s leading to an “AI spring” (e.g., Krizhevsky et al., 2012)

- contributing factors:

• access to huge, labeled datasets

• availability of enormous computing power in graphics-processing units originally designed for video games

• multiple ______ of neurons, allowing representation of more abstract concepts

e.g., pixels edges features faces

- examples:

• Google Brain (Le at al., 2012):

▸ has 16,000 processors with 1 billion connections

▸ watched 10 million YouTube videos and by itself was able to identify what a ___ was despite being fed no information on distinguishing features that might help identify it

• Microsoft’s Project Adam (Chilimbi et al., 2014):

▸ has 2 billion connections, requires 30× fewer processors, but is claimed to be twice as accurate as competitors

▸ processed ImageNet database, which has 14 million images organized into 22,000 categories

▸ result: can identify dogs in images, even whether a _____ is a Pembroke or a Cardigan

Real-World Applications:

• Autonomous vehicle (e.g., Waymo Driver)

- navigates using data from GPS, camera, LIDAR (laser imaging, detection, and ranging), and multiple radar sensors

- ANN also “______” by human driver

- has driven over 32 million km by itself

- autonomous technologies increasingly available in consumer vehicles (e.g., automatic parking, collision avoidance systems, autonomous cruise control, etc.)

- limitations:

cannot drive on an area not yet ______

cannot detect lane markings in wet/snowy conditions

• Large language model chatbots: OpenAI’s ChatGPT

- ANN based on statistical relationships of words in its training data

- GPT = generative pre-trained transformer

▸ generative: can simulate conversations, create computer code, and write prose and poetry

▸ pre-trained: trained on a corpus of text containing 300 billion words taken from web pages, programming language documentation, books, YouTube videos, and other sources

▸ transformer: deep-learning architecture that processes sequential data (like words in a sentence)

- GPT-4 estimated to have 1.76 trillion “__________” or variables; these include weights of connections between ANN neurons

- limitations:

often gives plausible but _________ answers

subject to algorithmic bias present in the training data, including gender and racial biases

The Future of AI

AI Calibre (Urban, 2015)

• AI Calibre 1: Artificial ______ Intelligence, a.k.a. weak AI

- may be equal to or superior to human intelligence in a very narrow domain

- but is the same as humans only at level 1, computational theory, in Marr’s tri-level hypothesis

e.g., IBM’s Deep Blue defeated world chess champion Kasparov in 1997

• AI Calibre 2: Artificial _______ Intelligence, a.k.a. strong AI

- can perform the same tasks a human can

- is the same as humans at level 1, and at level 2, representation and algorithm as well

e.g., N/A

• AI Calibre 3: Artificial _________________

- “any intellect that greatly exceeds the cognitive performance of humans in virtually all domains of interest” (Bostrom, 2014, p.22)

- likely will be the same as humans only at the level of computational theory

e.g., N/A

AI & Consciousness

John Searle (1980): The _______ ____ argument

- imagine you are alone in a room

1. slip of paper with Chinese symbols enters room

2. you look up symbol in book

3. you write down more symbols on paper

4. you send slip out

- you do not understand Chinese

- so where is the _____________ of Chinese?

- refutes strong AI: computers just manipulate symbols; they will never have a “mind” or “consciousness” which is an emergent property

- based on two ideas:

• brains cause _____

• ______ does not suffice for semantics

Ray Kurzweil (1999): The Age of Spiritual Machines: When Computers Exceed Human Intelligence

- computers are doubling in power about every 2 years

- predicted they should have computational capacity comparable to human brain by ____

- defines consciousness as:

• the ability to have subjective experience

• the ability of a being, animal or entity to have self-perception and self-_________

• the ability to feel

- predicted a computer will declare, “I think, therefore I am.” before ____

- in 2005, predicted a “technological ___________,” when artificial superintelligence emerges, in 2045

“The question of whether machines can think...is about as relevant as the question of whether submarines can swim.” -- Edsger W. Dijkstra, 1984