Human & Artificial Intelligence

 

 

Learning Outcomes

1. What is intelligence?

2. What are the origins of modern intelligence tests?

3. Explain some issues with intelligence testing.

4. What are some different views on the nature of intelligence?

5. How do factors other than intelligence contribute to success?

6. Describe the beginnings of artificial intelligence (AI). What is the Turing Test?

7. What is an expert system? How is common sense important to AI?

8. Describe artificial neural networks, and deep learning in particular

9. What are some successful real-world applications of AI?

10. What is the potential future of AI? How does AI relate to consciousness?

 


 

What is Intelligence?

 

• “______ _____”?

• “____________ as a measurable capacity must at the start be defined as the capacity to do well in an intelligence test. Intelligence is what the tests test” (Boring, 1923, p.35)

• an inferred trait, representing the abilities to learn from experience, acquire knowledge, think abstractlly, and adapt to changes

 

The First Tests

 

Francis Galton (1822-1911):

- ran a service at South Kensington Museum in London

- would check your intelligence for a fee

- tests mostly __________

- pros & cons:

☑ pioneer in the study of intelligence

☑ contributed much to statistics

☒ concluded success was due to ________

☒ started the ________ movement: improving humanity’s physical and mental composition by selective parenthood

 

Alfred Binet & Theodore Simon (1905/1916):

- developed a test to identify ____ learners

- found they performed at the level of younger children

- compared ______ age (MA) with chronological age (CA)

- components:

__________ e.g., “What does misanthrope mean?”

_____________ e.g., “Why do people sometimes borrow money?”

______ relations e.g., “What do an orange, an apple, and a pear have in common?”

 

- assumptions:

• test shows current performance differences

• purpose: identify children who need special ____

• believed special training could help slow learners catch up

• test not based on a theoretical conception of “____________”

 

Lewis Terman (1916):

- modified test for use in US (Stanford-Binet Intelligence Scales)

- based on Stern’s (1912) intelligence ________:

IQ = (MA / CA) × 100

- test fine-tuned to make mean score = 100, standard deviation = 15

- pros & cons:

☑ easy to administer and score

☒ hard to _______ children of different chronological ages

☒ could not be applied to ______

☒ test was largely language-based

 

David Wechsler (1939):

- developed tests:

• Wechsler Adult Intelligence Scale (WAIS)

• Wechsler Intelligence Scale for Children (WISC)

• Wechsler Preschool and Primary Scale of Intelligence (WPPSI)

- provide verbal, performance, and overall scores, plus IQ-equivalents

- included more _________ questions

 

Issues

 

• ________

James McKeen Cattell (1890):

- devised tests based on Galton’s work

- assessed by his student, Clark Wissler

- virtually no ___________ between intelligence and college grades

(conventional IQ tests: correlations of .40 to .60)

 

• tests used extensively during WWI

- Army Alpha: verbal test

- Army Beta: performance test, with __________ directions instead of words

- used to determine job classification and leadership potential (or, who got sent to the front lines)

 

• intelligence came to be seen as inherited _____, not index of learning performance

 

• tests __________ biased:

e.g., “I don’t sing for nobody” Single- or double-negative?

e.g., Dove Counterbalance General Intelligence Test (1971), a.k.a. the Chitling Test

 

- Is one culture less intelligent, or is the test ______?

- Some cultures don’t believe in testing or competitive _______--should they be exempt?

- Do results reveal a lack of _______ important in our society?

- Is the ______ for a low score important?

 

• most intelligence tests are ____________: depend on the number of correct answers

- possible solution: use other kinds of _____, as well

- check for physiological or psychological problems

- diagnose specific learning disability

 

_____ effect (James R. Flynn, 1987; 2000; 2007):

- unstandardized intelligence scores have been increasing over time

14-point gain from 1932 to 1978

annual gain from 1947-1972 was 0.31 points (0.36 in the 1990s)

greatest increases seen in lower grades, but not for children in grade 12 (limit?)

not all subtests increase equally, thus intelligence is not a ______ entity

- the nature of intelligence has changed from practical to conceptual, due to social change

e.g., What do dogs and rabbits have in ______?

- increases in IQ are not be due to changes in genetics--they must be _____________

influence of television, schooling, parenting, etc.

kids watching basketball games on TV could learn new moves and improve their performance to match star players; their improved performance would challenge and enrich the playing of other kids on the court

jobs increasingly required more logical reasoning and analysis, so schooling focused more on logical reasoning and analysis and more people complete high school, leading to generational changes in abstract reasoning

- this is called the social multiplier effect: a virtuous cycle of skill improvement

 


 

Other Intelligences

 

Charles Spearman (1927):

- is intelligence monolithic, or comprised of different factors?

- factor analysis: determines minimum number of dimensions necessary to explain a pattern of correlations among subtests

- correlated scores on verbal, quantitative, analytical subtests

- some evidence for a general “g ” factor of intelligence

- correlations are not perfect; there are specific factors affecting each subtest

- pros & cons:

☑ more _______ tasks (e.g., academic achievement, job performance) highly correlated with g

☒ other theorists have proposed other intellectual abilities that are uncorrelated with each other

 

Theory of Multiple Intelligences (Howard Gardner, 2008, 2011b):

- ________ for intelligence:

• isolated by brain damage

• existence of prodigies, savants

• identifiable core operations (e.g., music: melody, harmony, rhythm, etc.)

• distinctive developmental history

• evolutionary plausibility

• support from experimental psychology

• psychometric support

• encodable in a symbol system

 

- multiple intelligences:

 

__________: poets, writers, linguists

 

_______-mathematical: mathematicians, scientists, philosophers

 

_______: composers, conductors, musicians

 

spatial: architects, artists, navigators

 

______-kinesthetic: dancers, athletes, actors

 

_____________ (understanding others) and intrapersonal (understanding oneself): psychiatrists, politicians, anthropologists

 

naturalist: biologists, naturalists

 

• (maybe also existential: spiritual leaders?)

 

- how should these be used?

• descriptive, not ____________

• not in standardized test, but in real life or simulations

• for more effective ________ and assessment

(but there is virtually no evidence that matching type of instruction to “learning style” optimizes learning (Pashler et al., 2008))

 

- criticisms:

☒ lacks empirical ________; rather, many of Gardner’s intelligences correlate with g

☒ there is no test of multiple intelligences; assessment relies on subjective judgment

☒ “intelligences” are more like “_________” or “aptitudes” (what’s the difference?)

 

Triarchic Theory of Successful Intelligence (Robert Sternberg, 1985):

 

1. ____________ (analytical) intelligence

 

a) ______________: recognizing a problem, selecting a procedure to solve it, checking the results

 

b) performance components: planning, implementing the procedure

 

c) _________-acquisition components: learning how to solve a problem

 

2. ____________ (creative) intelligence

• applying your knowledge on a specific task (automated or novel tasks)

• may require creativity/divergent thinking

 

3. __________ (practical) intelligence

• ability to a) adapt to, b) shape, or c) select one’s environment

 

- measured by STAT (Sternberg Triarchic Abilities Test)

- intelligence is more a matter of using what you’ve got, not how much you’ve got

- criticisms:

☒ traditional intelligence tests correlate with income, occupational prestige, and ability to stay out of jail, which are supposedly measures of practical intelligence

 

Other Factors

 

Terman & colleagues: “Genetic Studies of Genius” (1925-1959)

- how to bridge gap between potential and achievement?

- 1,528 gifted children with IQ greater than 135 (top 1%) were followed starting in 1921

- compared most vs. least successful (school/work/ambition)

- no __ difference, but difference in __________

 

Duckworth & Seligman (2005):

- self-discipline in 8th-graders measured by self-report, parent report, teacher report, and monetary choice questionnaire

- self-discipline predicted final grades, attendance, standardized achievement test scores, selection into competitive high school program

- self-discipline was more important than IQ in contributing to final grades

 


 

Artificial Intelligence (AI): Beginnings

 

John von Neumann (1945):

- conceived of stored ________ program for controlling operations of computer hardware (vs. physical switches)

- headed team to develop machine to calculate artillery trajectories for Ballistics Research Laboratory

 

J. Presper Eckert & John W. Mauchly (1946):

- built _____: Electronic Numerical Integrator And Calculator in 1946

- first electronic large-scale general-purpose digital computer

- first to handle __________ and numeric data

 

Alan Turing (1950):

- asked, “Can machines _____?”

- developed “the imitation game,” a.k.a. the Turing test:

• judge sits in a room, with an ______ in another room

• communicates with entity via keyboard

• is the entity human or not?

- not proposed as a test of ____________

 

Newell & Simon (1956):

- created Logic Theorist program

- designed to solve mathematical proofs, play games

- later created General Problem Solver

 

Artificial intelligence”coined by John McCarthy in 1956

- programs were not “______”, but didn’t have to be

e.g., flying: birds vs. ornithopters vs. airplanes

 

Computer Vision

- Marvin Minsky, Terry Winograd attempted robotic vision

- used simplified version of the world: “_____ world”

- robots programmed to “see” and move blocks

- not very __________

e.g., tried to build tower from top down

 

AI & Language

 

- Weizenbaum (1966) wrote ELIZA, a virtual Rogerian therapist

• was merely a “bag of ______”

 

- Chamberlain & Etter (1984) created RACTER (short for “raconteur”)

• BASIC program, running on Z80 chip with 64K RAM

• RACTER “wrote” The Policeman’s Beard is Half Constructed, a collection of ______ and prose

 

- these are mere “___________”, with primitive knowledge bases; understanding language is hard

 

Machine Translation

- could work word-by-word (sort of)

- plagued by problems due to inherent _________ of language

e.g., “Mary saw the bicycle in the store window, and she wanted it.”

vs. “Mary saw the bicycle in the store window; she looked at it longingly and pressed her nose up against it.”

- a sentence in a technical journal had 1+ _______ syntactically correct interpretations

 

Criticisms

 

Moravec’s paradox (1988):

- for computers, “hard things are easy, and easy things are hard”

- things most people find difficult can easily be done by computers (e.g., playing chess or doing calculus)

- however, abilities that humans find easy or even trivial are very difficult for computers to do (e.g., object perception, understanding context in conversations)

 

Dreyfus (1972):

- wrote What computers can’t do: A critique of artificial reason

- problems due to fundamental ___________ between humans and computers:

• consciousness

• body to unitize sensory experience

• fatigue, boredom, drive

• intentionality (sense of _______)

 

Lighthill (1972):

- 20 years of research into AI had been a great ______________

 

Result: “AI ______” of decreased funding and research.

 


 

Classical AI: Expert Systems

 

- everyday, generalized intelligence is hugely complex

- instead, concentrate on intelligence displayed by experts in a ______ domain

- expert system consists of _________ base developed by knowledge engineer

- uses inference ______ to apply rules to facts, to solve a problem

- number of expert systems has increased to tens of thousands; $1+ billion industry

- most popular: finance, manufacturing control, fault diagnosis

 

Shortliffe & colleagues (1973): MYCIN

- asks questions like, “Has the patient recently suffered burns?” or “Does the patient have a known allergy to Colistin?”

- knowledge of bacterial infections represented as ~450 _____

- performed as well as _______ (better than med students and residents) at Stanford Medical School (Yu et al., 1979)

 

Pros & Cons:

☑ highly specialized and accurate

☒ difficult to _________ knowledge of experts

☒ no intuition; could not _____ from mistakes

☒ limited to domain of expertise (“weak” or “narrow” AI)

 

AI & Common Sense

 

- scripts (Schank & Abelson, 1977) and frames (Minsky, 1975) developed to aid machine translation

- explicitly represented background knowledge

- give a frame of reference

 

Lenat (1984, 2017): Cyc (short for “____________”)

- explicitly represents knowledge not found in an encyclopedia

- “common sense knowledge base” of over 24 million hand-entered rules; divided into “_____________”

e.g., “On January 2, Abraham Lincoln was in Washington” implies that:

• Lincoln’s left ___ was probably in Washington, too

• his parents remained his parents for life

• he was in Washington for the whole day

- pros & cons:

☑ successful applications include Terrorism Knowledge Base, natural-language database of medical information, and financial analysis

☒ still a work in progress (cannot separate fact from fiction; hand-entering knowledge is slow and tedious; expensive to develop)

 


 

Artificial Neural Networks

 

all of the above are GOFAI (“Good Old Fashioned Artificial Intelligence”), or classical symbolic AI based on human-coded programming

- the problem with AI is that it doesn’t have a _____

- solution: give it a brain!

- artificial neural networks (ANNs) are inspired by the function of neurons

- a.k.a. PDP approach, or connectionism: emergent properties that arise from interconnected networks of processing units

- instead of being programmed with explicit rules, ANNs apply _______ ________: letting computers develop algorithms iteratively from data

 

Perceptron:

- early attempt at ANN by Frank Rosenblatt (1958)

- connections could be modified, but network was simple and computationally _______

 

New Connectionism:

- algorithms were developed to allow ANNs to self-modify connections in the mid-1980s

- result: ANNs with more sophisticated neurons used feedback to ______ connection weights (supervised learning)

- were able to _____ from experience

- had greater computational power; could solve more complex problems

- however, the computational limitations of this approach eventually became apparent leading to a second AI winter in the 1990s

 

Deep Learning:

- new approach to ANNs inspired by the structure and organization of the ________ ______, starting in the mid-2000s leading to an “AI spring” (e.g., Krizhevsky et al., 2012)

- contributing factors:

• access to huge, labeled datasets

• availability of enormous computing power in graphics-processing units originally designed for video games

• multiple ______ of neurons, allowing representation of more abstract concepts

e.g., pixels → edges → features → faces

- examples:

• Google Brain (Le at al., 2012):

has 16,000 processors with 1 billion connections

watched 10 million YouTube videos and by itself was able to identify what a ___ was despite being fed no information on distinguishing features that might help identify it

• Microsoft’s Project Adam (Chilimbi et al., 2014):

has 2 billion connections, requires 30× fewer processors, but is claimed to be twice as accurate as competitors

processed ImageNet database, which has 14 million images organized into 22,000 categories

result: can identify dogs in images, even whether a _____ is a Pembroke or a Cardigan

 

Real-World Applications:

• Autonomous vehicle (e.g., Waymo Driver)

- navigates using data from GPS, camera, LIDAR (laser imaging, detection, and ranging), and multiple radar sensors

- ANN also “______” by human driver

- has driven over 32 million km by itself

- autonomous technologies increasingly available in consumer vehicles (e.g., automatic parking, collision avoidance systems, autonomous cruise control, etc.)

- limitations:

☒ cannot drive on an area not yet ______

☒ cannot detect lane markings in wet/snowy conditions

 

• Large language model chatbots: OpenAI’s ChatGPT

- ANN based on statistical relationships of words in its training data

- GPT = generative pre-trained transformer

generative: can simulate conversations, create computer code, and write prose and poetry

pre-trained: trained on a corpus of text containing 300 billion words taken from web pages, programming language documentation, books, YouTube videos, and other sources

transformer: deep-learning architecture that processes sequential data (like words in a sentence)

- GPT-4 estimated to have 1.76 trillion “__________” or variables; these include weights of connections between ANN neurons

- limitations:

☒ often gives plausible but _________ answers

☒ subject to algorithmic bias present in the training data, including gender and racial biases

 


 

The Future of AI

 

AI Calibre (Urban, 2015)

AI Calibre 1: Artificial ______ Intelligence, a.k.a. weak AI

- may be equal to or superior to human intelligence in a very narrow domain

- but is the same as humans only at level 1, computational theory, in Marr’s tri-level hypothesis

e.g., IBM’s Deep Blue defeated world chess champion Kasparov in 1997

 

AI Calibre 2: Artificial _______ Intelligence, a.k.a. strong AI

- can perform the same tasks a human can

- is the same as humans at level 1, and at level 2, representation and algorithm as well

e.g., N/A

 

AI Calibre 3: Artificial _________________

- “any intellect that greatly exceeds the cognitive performance of humans in virtually all domains of interest” (Bostrom, 2014, p.22)

- likely will be the same as humans only at the level of computational theory

e.g., N/A

 

AI & Consciousness

 

John Searle (1980): The _______ ____ argument

- imagine you are alone in a room

1. slip of paper with Chinese symbols enters room

2. you look up symbol in book

3. you write down more symbols on paper

4. you send slip out

- you do not understand Chinese

- so where is the _____________ of Chinese?

- refutes strong AI: computers just manipulate symbols; they will never have a “mind” or “consciousness” which is an emergent property

- based on two ideas:

• brains cause _____

• ______ does not suffice for semantics

 

Ray Kurzweil (1999): The Age of Spiritual Machines: When Computers Exceed Human Intelligence

- computers are doubling in power about every 2 years

- predicted they should have computational capacity comparable to human brain by ____

- defines consciousness as:

• the ability to have subjective experience

• the ability of a being, animal or entity to have self-perception and self-_________

• the ability to feel

- predicted a computer will declare, “I think, therefore I am.” before ____

- in 2005, predicted a “technological ___________,” when artificial superintelligence emerges, in 2045

 

“The question of whether machines can think...is about as relevant as the question of whether submarines can swim.” -- Edsger W. Dijkstra, 1984