ED. NOTE: In May, 2004, Edge published Danny Hillis's essay in which he proposed Aristotle: The Knowledge Web. "With the knowledge web," he wrote, "humanity's accumulated store of information will become more accessible, more manageable, and more useful. Anyone who wants to learn will be able to find the best and the most meaningful explanations of what they want to know. Anyone with something to teach will have a way to reach those who what to learn. Teachers will move beyond their present role as dispensers of information and become guides, mentors, facilitators, and authors. The knowledge web will make us all smarter. The knowledge web is an idea whose time has come." Last week, Hillis announced a new company called Metaweb, and the free database, Freebase.com. The launch was covered by John Markoff in his New York Times article "Start-Up Aims for Database to Automate Web Searching" [3.9.07]. Below is an addendum to his essay that he has written for Edge as well as a link to the original essay and subsequent Edge Reality Club discussion. ADDENDUM
TO "ARISTOTLE: THE KNOWLEDGE WEB"
In the spring of 2000, while I was writing the Aristotle essay, Jimmy Wales and Larry Sanger began taking a much more practical approach to a similar problem. Their project, called Nupedia, was an attempt to create a carefully edited encyclopedia of the world's knowledge that would be available to anyone, for free. The Nupedia project made some progress, but the going was slow. About a year later, they put a wiki on the web, allowing anyone to contribute feed material to Nupedia. That feeder project was called "Wikipedia". What happened next is a piece of History that should make us turn-of-the-millennium humans all feel proud. While all this was going on, I continued to plug away, trying to build a prototype of the more structured, computer-mediated knowledge base that is described in the essay. Comments from my friends, including those posted on Edge, helped me realize that trying to build a tutor and a knowledge base at the same time would be biting off way too much. So, I decided to concentrate on the "Knowledge Web" part of the problem. Even that seemed to be an uphill battle, because the collapse of the dot-com boom dimmed the funding prospects for all things connected. It was not a good time for ambitious ideas. Or maybe it was a good time. It was during this period, undistracted by the frenzy of a boom, that Google and Wikipedia were able to build spectacularly ambitious tools that made us all smarter. Eventually, their success reminded everyone that the information revolution was just beginning. There was renewed enthusiasm for ambitious dreams. This time, I was better prepared for it, having met, during the interim, a great product designer (Robert Cook) and a great engineer (John Giannandrea), both of whom shared the dream of building a connected database of human knowledge that could be presented by computers, to humans. In retrospect the key idea in the "Aristotle" essay was this: if humans could contribute their knowledge to a database that could be read by computers, then the computers could present that knowledge to humans in the time, place and format that would be most useful to them. The missing link to make the idea work was a universal database containing all human knowledge, represented in a form that could be accessed, filtered and interpreted by computers. One might reasonably ask: Why isn't that database the Wikipedia or even the World Wide Web? The answer is that these depositories of knowledge are designed to be read directly by humans, not interpreted by computers. They confound the presentation of information with the information itself. The crucial difference of the knowledge web is that the information is represented in the database, while the presentation is generated dynamically. Like Neal Stephenson's storybook, the information is filtered, selected and presented according to the specific needs of the viewer. John, Robert and I started a project, then a company, to build that computer-readable database. How successful we will be is yet to be determined, but we are really trying to build it: a universal database for representing any knowledge that anyone is willing to share. We call the company Metaweb, and the free database, Freebase.com. Of course it has none of the artificial intelligence described in the essay, but it is a database in which each topic is connected to other topics by links that describe their relationship. It is built so that computers can navigate and present it to humans. Still very primitive, a far cry from Neal Stephenson's magical storybook, it is a step, I hope, in the right direction. "ARISTOTLE: THE KNOWLEDGE WEB" By W. Daniel Hillis [5.6.04] W. DANIEL (Danny) HILLIS is currently Chairman and Chief Technology Officer of Applied Minds, Inc., is best known for his innovative work in the design and implementation of the massively parallel supercomputer. Applied Minds is a research and development company creating a range of new products and services in software, entertainment, electronics, biotechnology and mechanical design. He is the author of The Pattern On The Stone: The Simple Ideas That Make Computers Work.
SAN FRANCISCO,
March 8 — A new company founded by a longtime technologist
is setting out to create a vast public database intended to be read
by computers rather than people, paving the way for a more automated
Internet in which machines will routinely share information. He says
his latest effort, to be announced Friday, will help develop a realm
frequently described as the "semantic Web" — a set
of services that will give rise to software agents that automate
many functions now performed manually in front of a Web browser. In contrast, Mr. Hillis envisions a centralized repository that is more like a digital almanac. The new system can be extended freely by those wishing to share their information widely. ... ... In its ambitions, Freebase has some similarities to Google — which has asserted that its mission is to organize the world's information and make it universally accessible and useful. But its approach sets it apart. "As wonderful as Google is, there is still much to do," said Esther Dyson, a computer and Internet industry analyst and investor at EDventure, based in New York. Most search engines are about algorithms and statistics without structure, while databases have been solely about structure until now, she said. "In the middle there is something that represents things as they are," she said. "Something that captures the relationships between things." That addition has long been a vision of researchers in artificial intelligence. The Freebase system will offer a set of controls that will allow both programmers and Web designers to extract information easily from the system. "It's like a system for building the synapses for the global brain," said Tim O'Reilly, chief executive of O'Reilly Media, a technology publishing firm based in Sebastopol, Calif. ... |
|
With the knowledge web, humanity's accumulated store of information will become more accessible, more manageable, and more useful. Anyone who wants to learn will be able to find the best and the most meaningful explanations of what they want to know. Anyone with something to teach will have a way to reach those who what to learn. Teachers will move beyond their present role as dispensers of information and become guides, mentors, facilitators, and authors. The knowledge web will make us all smarter. The knowledge web is an idea whose time has come. "ARISTOTLE"
(THE KNOWLEDGE WEB) [5.6.04] THE REALITY CLUB: Responses by Douglas Rushkoff, Marc D. Hauser, Stewart Brand, Jim O'Donnell, Jaron Lanier, Bruce Sterling, Roger Schank, George Dyson, Howard Gardner, Seymour Papert, Freeman Dyson, Esther Dyson, Kai Krause, Pamela McCorduck Introduction Part of Danny Hillis's charm is his childlike curiosity and demeanor. The first time we talked was on the telephone one Sunday morning in 1988 when he was at his home in Cambridge. We got into a serious discussion about the relationship of physics to computation. "This is interesting," he said. "I'd like to come to New York and continue the conversation face-to-face." Three hours later, my doorbell rang, and there stood a young man, looking like a clean-cut hippie. He had long hair, wore a plain white T-shirt and jeans, and carried nothing. He lived up to his reputation as the "boy wonder". We talked for hours. Danny's energies at that time were concentrated on getting processors to work together so that computation takes place with communicating processors, as happens with the Internet. The Net's potential to become an organism of intelligent agents interacting with each other, with an intelligence of its own that goes beyond the intelligence of the individual agents fired Danny up. "In a sense," he said, "the Net can become smarter than any of the individual people on the Net or sites on the Net. Parallel processing is the way that kind of emergent phenomenon can happen. The Net right now is only a glimmer of that." Danny described the Internet of that time simply as a huge document that is stored in a lot of different places and that can be modified by many people at once, but essentially a document in the old sense of the word. In principle, the Internet could be done on paper, but the logistics are much better handled with the computer. "I am interested in the step beyond that," he says, "where what is going on is not just a passive document, but an active computation, where people are using the Net to think of new things that they couldn't think of as individuals, where the Net thinks of new things that the individuals on the Net couldn't think of." "In the long run, the Internet will arrive at a much richer infrastructure, in which ideas can potentially evolve outside of human minds. You can imagine something happening on the Internet along evolutionary lines, as in the simulations I run on my parallel computers. It already happens in trivial ways, with viruses, but that's just the beginning. I can imagine nontrivial forms of organization evolving on the Internet. Ideas could evolve on the Internet that are much too complicated to hold in any human mind." The passages quoted above are from my book Digerati, published in 1995, a time when the ideas set forth by Danny were technologically implausible. In 2000, still on the same track, Danny wrote the prescient paper Aristotle, in which he proposes "The Knowledge Web", again at a time when the technological possibilities did not equal the vision. But now in 2004, Danny is a grown-up wonder, and we are in the age of Google, of boy wonders Sergey Brin and Larry Page, and the rapidly expanding potential of the Internet as a knowledge web. In
this regard, thanks to funding from the Markle Foundation,
Danny been able to assemble a group of people to begin
to discuss of the implementation of a medical application
based on his ideas. Other possibilities for applications
are open-ended. What has changed in the past nine years
is that implementation of his ideas are now technologically
feasible. —JB W.
DANIEL (Danny) HILLIS is currently Chairman and Chief
Technology Officer of Applied Minds, Inc., is best known
for his innovative work in the design and implementation
of the massively parallel supercomputer. Applied Minds
is a research and development company creating a range
of new products and services in software, entertainment,
electronics, biotechnology and mechanical design. He is
the author of The Pattern On The Stone: The Simple
Ideas That Make Computers Work. "ARISTOTLE" (THE KNOWLEDGE WEB
(DANNY HILLIS:) I have always envied Alexander the Great,
because he had Aristotle as a personal tutor. In those
days,
Aristotle knew pretty much everything there was to know.
Even
better, Aristotle understood the mind of Alexander. He understood
which topics interested Alexander, what Alexander knew
and
did not know, and what kinds of explanations Alexander preferred.
Aristotle had been a student of Plato, and he was himself
a great teacher. We know from his writings that he was
full
of examples, explanations, arguments, and stories. Through
Aristotle, Alexander had the knowledge of the world at
his
command. This problem isn't new. In 1945, Vannevar Bush wrote an essay for Atlantic Monthly about out the problem of too much knowledge. He wrote,
Bush's imagined solution to this problem was something he called Memex. Memex was envisioned as a system for manipulating and annotating microfilm (computers were just then being invented). The system would contain a vast library of scholarly text that could be indexed by associations and personalized to the user Although Memex was never built, the World Wide Web, which burst onto the scene half a century later, is a rough approximation of it. As useful as the Web is, it still falls far short of Alexander's tutor or even Bush's Memex. For one thing, the Web knows very little about you (except maybe your credit card number). It has no model of how you learn, or what you do and do not know—or, for that matter, what it does and does not know. The information in the Web is disorganized, inconsistent, and often incorrect. Yet for all its faults, the Web is good enough to give us a hint of what is possible. Its most important attribute is that it is accessible, not only to those who would like to refer to it but also to those who would like to extend it. As any member of the computer generation will explain to you, it is changing the way we learn. A
New Tool for Learning For example, one topic in the knowledge web might be Kepler's third law (that the square of a planet's orbital period is proportional to the cube of its distance from the sun). This concept would be connected to examples and demonstrations of the law, experiments showing that it is true, graphical and mathematical descriptions, stories about the history of its discovery, and explanations of the law in terms of other concepts. For instance, there might be a mathematical explanation of the law in terms of angular momentum, using calculus. Such an explanation might be perfect for a calculus-loving student who is familiar with angular momentum. Another student might prefer a picture or an interactive simulation. The database would contain information, presumably learned from experience, about which explanations would work well for which student. It would contain representations of many successful paths to understanding Kepler's law. Given such a database, it is well within the range of current technology to write a program that acts as a tutor by selecting and presenting the appropriate explanations from the database. The automated tutor would not need to create the explanations themselves— human teachers would create the explanations in the knowledge web, and the paths that connect them. The program would merely find the appropriate paths between what a student already knows and what he or she needs to learn. Along the way, the automatic tutor would quiz the student and respond to questions, much as a human tutor does. In the process, it would improve both its model of the student and the information in the database about the success of the explanations. For example, imagine yourself in the position of an engineer who is designing a critical component and wants to learn something about fault-tolerant design. This is a fairly specialized topic, and most engineers are not familiar with it; a standard engineering education treats the topic superficially, if at all. Fault-tolerant design is an area normally left to specialists. Unless you happen to have taken a specialized course, you are faced with a few unsatisfactory alternatives. You can call in a specialist as a consultant, but if you don't know much about the field it's difficult to know what kind of specialist you need, or if the time and expense are worth the trouble. You could try reading a textbook on fault-tolerant design, but such a text would probably assume a knowledge you may have forgotten or may never have known. Besides, a textbook is likely to be out of date, so you will also have to find the relevant journals to read about recent developments. If you find them, they will almost certainly be written for specialists and will be difficult for you to read and understand. Given these unsatisfactory choices, you will probably just give up. You will go ahead and design the module without benefit of the proper knowledge, and hope for the best. Learning
with Aristotle Aristotle would begin by asking you how much time you're willing to devote to this project and the level of detail you want. Then Aristotle would show you a map of what you need to learn. The tutor program does this by comparing what you know to what needs to be known to design fault-tolerant modules. It knows what needs to be known because this is a common problem faced by many engineers, and knowledgeable teachers have identified the key concepts many times. Aristotle knows what you know because it has worked with you for a long time. There may be some things you're familiar with that Aristotle doesn't know you know, but you can point these things out to Aristotle when it shows you the learning plan. Aristotle might take your word for what you know, but it is more likely to quiz you about some of the key concepts, just to make sure. Aristotle plans its lessons by finding chains of explanations that connect the concepts you need to learn to what you already know. It chooses the explanatory paths that match your favorite style of learning, including enough side paths, interesting examples, and related curiosities to match your level of interest. Whenever possible, Aristotle follows the paths laid down by great teachers in the knowledge web. Aristotle probably also has a model of how you want to be paced: when you have learned enough for one day, when it needs to throw in an interesting side story, etc. Along the way, Aristotle will not only explain things to you but will also ask you questions—both to make you think and to verify for itself that concepts are being learned successfully. When an explanation doesn't work, Aristotle tries another approach, and of course you can always ask questions, request examples, and give Aristotle explicit feedback on how it's doing. Aristotle then uses all these forms of feedback to adjust the lesson, and in the process it learns more about you. The process of teaching helps Aristotle learn to be a better teacher. If an explanation doesn't work, and consistently raises a particular type of question, then Aristotle records this information in the knowledge web, where it can be used in planning the paths of other students. The feedback eventually makes its way back to the knowledge web's human authors, so that they can use it to improve their explanations. Like any good tutor, Aristotle allows you to get sidetracked from a lesson plan and follow your interests. If you find a particular example compelling, you may want to know more about it. If some concept you have just learned allow you to appreciate an elegant explanation of some fact you already know, Aristotle may point it out. If you are close to understanding something of critical importance or something that would be of particular interest to you, Aristotle may decide to show it to you even though it is not strictly necessary as part of the lesson. Of course, as Aristotle gets to know you, it will know how much you like this sort of distraction. Once you have learned the material, and Aristotle has verified that you have learned it, the program will update its database to indicate that you have recently learned it. As you learn more and more, it will continue to connect your recently acquired knowledge to the new concepts you are learning, until you have fully integrated them. Because Aristotle knows which subjects you are and have been interested in, it can consolidate your learning by finding connections that tie these subjects together. For example, there's a short film of Dr. Richard Feynman explaining a principle of quantum mechanics called Bell's inequality. Most people have little interest in quantum mechanics and no interest at all in understanding Bell's inequality. Most quantum physicists already understand Bell's inequality and would learn little from Feynman's explanation. On the other hand, if you are a student who is just learning quantum mechanics, who has just mastered the necessary prerequisites, Feynman's explanation can be exciting, startling, and enlightening. It not only can explain something new but can also help you make sense of what you have recently learned. The trick is showing the film clip at just the right time. Aristotle can do that. I used an engineering example to describe Aristotle because most engineering knowledge is a straightforward, factual type of knowledge. Similar techniques would work for learning other subjects - history, mathematics, or the kind of technical information that might normally be conveyed in a training course or a technical manual. Of course, there are types of useful knowledge that a program like Aristotle would not be suited to: Aristotle would not be of much use in learning how to ride a bicycle or tell a joke. It would not replace hands-on experience, nor would it replace the enthusiasm and wisdom of a great teacher. What Aristotle would do is help you gain mastery of factual knowledge—exactly the kind of knowledge that is overwhelming us. How
the Knowledge Web Changes Education Teachers know that individual attention helps a child learn and they would like to give their students more of it. Even a young child has special interests, special topics that he or she would like to know more about. A good teacher learns to recognize these individual interests and tries to nurture them, but this takes a lot of time. A program like Aristotle will give teachers a tool to help children follow their passions. It will also enable teachers to evaluate a child's progress and to provide individualized instruction in areas in which the child has gaps. A computer program like Aristotle cannot replace most of what goes on in school, but it can complement what goes on there. It can free teachers from the routine job of broadcasting information and give them more time for individual attention to their students. A system like Aristotle also enables teachers by giving them a way to publish. Any good teacher knows how to teach certain topics especially well, but there are few easy ways for them to share that information effectively with others. A teacher can write a textbook, or develop a curriculum, but each of those efforts is a major undertaking. There is no simple way for a teacher to publish an isolated idea about how to explain something. If a system like Aristotle existed, then with the proper authoring tools a teacher could publish a single explanation—an effort comparable to creating a Web page. In fact, existing Web pages are a good source of initial content for the knowledge web. As Marshall McLuhan said, "The content of the new medium is the old medium." The initial content of the knowledge web will be the old curriculum materials, textbooks, and explanatory pages that are already on the World Wide Web. The existing materials already contain most of the examples, problems, illustrations, and lesson plans that knowledge web will need. As students gain access to the best explanations from the best teachers on a given subject, their own teachers will be able to take on the role of coaches and mentors. Freed from the burden of presenting the same information over and over, the teachers will be able to give greater individual attention to their students. A
Better Infrastructure for Publishing All of this raises the possibility of a different kind of economic underpinning for the knowledge web, one that is not possible on the document Web of today. The support infrastructure for payments would allow different parts of the knowledge web to operate in different ways. For instance, public funding might pay for the creation of curriculum materials for elementary school teachers and students, but specialized technical training could be offered on a fee or subscription basis. Companies could pay for encoding the knowledge necessary to train their employees and customers, consultants would be able to publish explanations as advertising for their services, and enthusiasts would offer their wisdom for free. Students could subscribe not only to particular areas of knowledge but to particular types of annotations, such as commentary or seals of approval. Schools and universities could charge for interaction with teachers and certification of the student's knowledge. The system could also become the ultimate hiring tool, since employers could map areas of knowledge that they need in prospective employees. What Makes the Knowledge
Web Different? Peer-to-Peer Teaching Vetting
and Peer Review The knowledge web addresses this problem by supporting an infrastructure for peer review and third-party certification. It supports mechanisms for the labeling, rating, and categorization of material, both by the author and by third parties. The browsing tools will allow information to be filtered, sorted, and labeled according to these annotations. In addition, user feedback tools will be built into the browsing software to help identify material that is particularly good, bad, or controversial. Linking
and Annotation
Ways to Pay Authors Much of the content on the knowledge web will probably be free, but there are a number of other economic models that can coexist with this. One of the most obvious is the paid course, in which a student pays tuition for a cluster of services, including access to teachers, curriculum materials, and interaction with other students, and some form of certification at the end. With the knowledge web, many institutions may choose to offer the curriculum material for free—as a form of advertising—and charge for the other services, especially the certification. The knowledge web would also help solve one of the online course providers' greatest problems, which is marketing. The knowledge web would help direct students to the courses that meet their need. Another model that may work well is a micropayment system, in which a student pays a fixed subscription fee for access to a wide range of information. Usage statistics would serve as a means to allocate the income among the various authors. This system has the advantage of rewarding authors for usefulness without penalizing students for use. The students' fees would be independent of the amount of use they make of the system. The ASCAP music royalty system and university payments for student access to Encyclopedia Britannica are examples of how such a system might work. Guided
Learning Table of Affordances
Achieving
Critical Mass Eventually traditional educational institutions will use this new system of learning. It will be used first by colleges and trade schools; then it will make its way into secondary and elementary education. It is important to emphasize that computers will not replace the teachers; rather, they will give teachers a new tool. Instead of spending most of their time broadcasting information to a group, teachers will have more time to help students integrate their knowledge through discussion and through individualized interaction. There are three technical components necessary for such a system to work: the tutor (browser), the authoring tool, and the knowledge web itself. This last component is the most difficult to create, but fortunately it does not have to be built all at once. Even a small part of it would be useful. Presumably the knowledge web will get its start in a few narrow areas, probably determined by available funding. For instance, it is easy to imagine a scenario in which a part of the knowledge web is initially funded by a supplier to explain the use of its products. Imagine, for example, that Cisco publishes the knowledge of how to configure and maintain its routers in this form. Another scenario is that the government sponsors the creation of the system for a specific application, such as continuing education for schoolteachers or job training for factory workers. A company might pay for the development of training programs for its workers and customers. A foundation might sponsor an initial effort as a way to have an impact on education. Summary:
An Idea Whose Time Has Come At the same time that a solution is becoming possible, the problem is reaching a crisis point: the amount of knowledge is becoming overwhelming, and the need for it is increasing. There is a widespread conviction that something radical needs to be done about education—both the education of children and the continuing education of adults. The world is becoming so complicated that schools are no longer able to teach students what they need to know, but industry is not equipped to deal with the problem either. Something needs to change. With the knowledge web, humanity's accumulated store of information will become more accessible, more manageable, and more useful. Anyone who wants to learn will be able to find the best and the most meaningful explanations of what they want to know. Anyone with something to teach will have a way to reach those who what to learn. Teachers will move beyond their present role as dispensers of information and become guides, mentors, facilitators, and authors. The knowledge web will make us all smarter. The knowledge web is an idea whose time has come. |
|
Danny simply yet fully articulates an idea he has been speaking about, in one way or or another, for the past six years. It seems strangely fitting this thinking is published just around the same time that Google has its IPO. For the more-is-more, horizontally indistinguishable landscape of the world wide web is reaching a certain point of diminishing returns (why else would Google be going public?) and a new, more dynamically structured organization for human knowledge is in order. While we may not have been able to imagine the need for such a realtime, dynamically composed tutoring matrix, it certainly addresses Bush's desire for a fully accessible knowledge bank more adequately than piling more websites into the text-searchable wasteland. The trick, of course, is achieving lift-off. My suggestion would be, rather than approaching the likes of Cisco or another corporate giant looking to train people about their industry, would be to tackle a highly critical and desperate area: a disease like cancer, AIDS, or multiple sclerosis, where armies of "untrained" but highly intelligent victims and families might like to lend their own cognitive processes to finding a cure. The Lorenzo's Oil contingent, if you will. They could prove a perfect test sample for your tutoring hypothesis, allowing them to become more versed in cancer-specific treatment methodologies than most physicians, without having to go to medical school. And it
might be the kind of cause that would generate more immediate acceptance
of the applicability of this system to the world's most pressing problems. |
|
Danny Hillis's notion of a web based tutor—a digital school marm—is interesting, like so many of his ideas. But like all forms of guided learning, part of the art form comes from a tutor who can not only teach effectively, but a tutor who deeply understands what another knows. The latter bit is non-trivial. Implicit in Danny's piece is an assumption that we are giving tests that properly extract what another knows. I think this is the general problem with most education. Teachers fail to think about the different ways in which someone does or does not know something. Let's say you give an exam and student Danny scores 100 and student Marc scores 0. You have certainly given an exam that demonstrates differences in performance, but have you truly demonstrated a difference in their competence. Perhaps Marc freezes during multiple choice tests, or is just terrible at math. Perhaps Danny has a photographic memory, regurgitates all the right answers, but hasn't a clue about the underlying principles. As everyone knows, test scores provide only one window into a person's knowledge. Different tests extract different kinds of knowledge. Different tests interact with different kinds of test takers. What makes someone vary from test to test? Some of it is there knowledg —the facts and theories at their fingertips—some of it is their general capacity for reasoning, some of it their specific capacity to reason in a particular domain, and some of it has to nothing to do with knowledge or reasoning, but merely the mood of the day, the composition of the class, and whether their boyfriend dumped them or they just ate a hot fudge sunday. But there is another constraint: knowledge that is innate, part of our hardware. Although we are only beginning to uncover this machinery, the digital school marm will need to engage it. Part of this machinery, when properly harnessed, may well facilitate learning. Conversely, teaching that is insensitive to our innate knowledge may well retard learning. Consider mathematics. Although our species alone has invented the symbol systems and operations that fill math books in calculus, geometry, and algebra, every human is born with a number sense that consists of two core mechanism: one system quantifies small numbers [up to about 3 or 4] precisely and a second quantifies large numbers approximately [subject to Weber ratios as opposed to absolute numbers]. Work with brain damaged patients, brain imaging, and primate physiology, confirms the anatomical specificity of these systems. Of course, when these mechanisms click into gear, they unconsciously generate judgments of quantity without revealing their mode of operation. Intriguingly, Gauss made the following comment concerning his own insights into mathematics: "I have had the results for a long time, but I do not know yet know how to arrive at them." Similarly, Boole noted that "It is not the essence of mathematics to be conversant with the ideas of number and quantity." Currently, educators in mathematics have no awareness of these mechanisms, and thus, typically implement teaching methods guided toward precision with large numbers. It is an open question, however, whether early math education might effectively tap this innate knowledge in order to not only speed up but enhance the acquisition of mathematical knowledge. What's
the difference between a great teacher and a good teacher? Studies
by social psychologists reveal that students make judgments about
the caliber of a professor within the first few minutes of the first
lecture of the first class, and this judgment accurately predicts
the evaluation they give the professor at the end of the course. Tell
a good joke, and you may end up being perceived as the best thing
since sliced bread. I am not sure I or anyone else has a coherent
answer to the great-good distinction [I assume everyone can articulate
what constitutes a bad teacher], but it is a topic worthy of study.
Part of teaching entails dissemination of information. But there are
many other parts: opening minds, providing tools for thinking and
reasoning, providing enjoyment about knowledge, etc. The digital school
marm is a great idea, but we may have to confine her to only a few
classes. |
|
From massively parallel processing to massively parallel knowledge certainly feels like a natural progression in Danny's work. Reading
the 2000 paper in 2004, I crave two appendices to it from Danny... |
All
our paradigms of learning fall far short of what we really do.
The paradigms still date back to the hunter-gatherer age of knowledge.
We are trained when young to be squirrels, accumulating acorns
for future use and building the skills to seek out just the acorn
we need when we want it. I'm not sure that was ever an accurate
description of how people do in fact learn, but it has been a
model of how we teach, one acorn at a time. Bad studying (the
sort of thing that happens in desperate hypercaffeinated dorm
rooms at 3 a.m.) is a parody of that acorn-management learning
system. I don't think Hillis quite imagines that Hillis-soft Corporation will release Hillis-soft Doorways Tutorware v. 3.0 and sell it for $595 (pay for upgrades to get the version with the obvious bugs removed) any time soon. Some of what we can imagine will come from database designers, some from search engine advances, and some from what I imagine to be fiendishly simple and clever open source tools that turn up on some website someday with a name as odd as "Google" or "Yahoo" once seemed. Is Amazon's new A9 search engine going to do the trick? Well, it doesn't blow my socks off yet, but it already has what Hillis's piece (more than five minutes old and so, like all good visionary work, already coming true) says the web doesn't have, the ability to annotate web pages. While we slow down to talk about this stuff, the world goes on changing around us. |
|
If computer folk are about to be flattered yet again with a rush of fresh cash (I'm thinking of that Google IPO, and the implied revitalization of Silicon Valley), even as the world at large might be falling apart, the least we can do is try to solve some of the long standing problems we were supposed to be doing something about all along. Education and the healthcare system come to mind immediately. So, yes. Let's raise our army and do education right. As Danny points out, there's a long history of utopian projects that aim to enhance learning through digital connectivity and good user interfaces. To his list I'd add a couple more of my favorite examples, Alan Kay's Squeak and Andy Van Dam's Exploratories. All the projects mentioned so far share the trait that they were built primarily by experts in order to be consumed by learners, but there have also been a variety of successful kid-driven networked learning technology experiments. My two favorites of these are Thinkquest and Whyville. One point of potential confusion ought to be addressed: Moore's Law has the unfortunate side effect of allowing so many virtual objects, or worse, proposed virtual objects, to come into existence that it is impossible to find enough names for all of them. In the case of "Knowledge Web", there are already a few namesakes. There's James ("Connections") Burke's Knowledge Web, which proposes to organize knowledge historically using loops of human-to-human relationships, and will have a spiffy VR-ish user interface. There's also a Knowledge Web that's associated with the Semantic Web research community, which is led by Tim Berners-Lee. Danny asks two big questions: 1) What technology strategy should we advocate? 2) What human strategy will not get bogged down in stupid politics? Here's my take: 1) On technology Why is Danny still carrying the burden of AI? What is gained by designing learning tools around a supposed artificial tutor? Certainly it's clear that we don't understand how a mind learns as yet. Yes, we're understanding more all the time about scattered details of the learning process, but I'm sure even the most hardcore of AIers would agree that many of our basic assumptions about learning are vulnerable to overhaul as we discover more about the brain and development in the coming decades. The only reason it's even possible to consider designing an artificial tutor is that it would be hard to isolate the effects of such a construction, so it would be hard to know if it really worked. We can't even make a robot car that can drive itself, so what makes it ok to assume that we can make a robot that can "model" a child's brain? The AI debates are old. Whether or not the ultimate project of AI makes any sense (and I believe it does not), can we at least agree that a program to model what humans know or how humans learn is not something we can plan on in the near future? While AI-ers like the idea of adding animated paper clips and "Wizards" to user interfaces as part of a march to a holy inauguration of a new life form, what happens in practice is that AI interfaces in training and education become the tools of the worst, dullest bureaucrats who seek to use automation to avoid liability and lower labor costs. Computers in the field are inevitably used to obscure feedback that could improve a process unless the user interface is designed so that it's clear that specific humans are responsible for what happens. Absolutely every function that can be provided by an AI interface can be provided by an honest interface better. Let's make a "Google" instead of an "Ask Jeeves." Let's make something with a user interface that's honest about what it can do and leave fantasies of future AI to the movie makers. The viability of an artificial tutor is not the only technology strategy question to be examined. A more subtle question is the degree to which knowledge can be represented by software at this time, in the future, or ever. If the tutor is like a verb, or an actor, knowledge representation is like the corresponding noun. Somehow "content" has to be found and presented in the proper context. Once the concern of theoreticians, automated content interpretation is no longer an academic question. Huge sums are spent, for instance, on attempts to improve software that can automatically classify internet content. Three non-glamorous examples are spam catching, obscenity blocking, and terrorist interception. The results of the first two have become part of our common experience, and the results are better than useless but short of inspirational. (Does education need to rise to the level of the inspirational to succeed?) My own web pages are blocked in most public school libraries, much to my dismay. Spam-catching filters are worth using, but in my experience are not yet good enough. Whenever I've thought they were working, it turned out I was actually missing out on messages I would have liked to have received. Automation always seems to work best when it is allowed to obscure its own results. The current crop of academic projects to add an ontology layer to the internet usually rely on the distributed volunteer efforts of large numbers of humans. Such approaches generally start with an ontology a little like the Dewey Decimal System, but different in that on the net a card catalog's categories can be more easily extended, and one can choose unlimited, or even fuzzy, partial categories for a given entry. There are also projects that hope to generate implicit ontologies by extending what Google and other search engines have done with specific words to loose collections of adjacent words. A less rigid model of association might generate a more useful ontology. A fair summary is to say that the whole business of generating context and ontology is still experimental, especially at large scales. Since money is flowing into the search business, there are a lot of people looking at these problems, and I hope the happy side effect of another Silicon Valley wealth convulsion will be accelerated progress. So, while I'm deeply skeptical of an artificial tutor, I'm somewhat hopeful about improved knowledge representation. What kind of technology strategy should follow from those conclusions? Since I'm skeptical about the prospects for automation, my guess is that the answer can be found by thinking about people and politics, which brings us to the second big question. 2) On humans Education in America is a mass of deadwood and sludge studded with gems. It's not too hard to find a magical school with devoted teachers to try out new educational technologies and generate promising results. These aren't necessarily the rich schools, either. It seems impossible, however, to preserve any of the best elements of educational technology in large scale deployment. The typical school computer is an aging business-oriented machine running useless software. It can't compete with the video game boxes kids love. Why? Part of the problem is entrenched political and financial interests. In the state of California, for instance, textbook publishers convinced the legislature to keep non-book curriculum materials out of the requirements, so that they must be conceived as a luxury. If you want to cure a case of excessive cheer, go take a look at some of the current textbooks and see how much they cost. I saw one recently in New York, a big, expensive elementary school math textbook filled with color pictures of diverse people who "love math", but displaying no love of math itself. It was also filled with errors. Look at how cheap video game boxes are and how expensive crummy required textbooks are. While medicine and defense are the first choices for those who think of the government as the worlds stupidest but most reliable customer, education comes in as a close third. There are two plans of attack, bottom-up and top-down. I don't want to dismiss top-down at all. We elite technical people know a lot of elite business and political people and maybe it's possible to start something grand. My sense is that this is what Danny hopes for, and I would do anything to help that effort, even if it included a non-catastrophic level of AI confusion. There have been a few bottom-up attempts, and I'd like to tell the story of one of them because it's instructive. Thinkquest was a project conceived by Al Weis, who used to run the supercomputer business for IBM and then built much of the core of the Internet's hardware infrastructure as it existed in the 1980s and early 1990s. The idea was to run a contest in which high school age kids from all over the world would compete to build the best curriculum entries in web format. The prizes were scholarships, along with cash awards to the winning kids' teachers and schools. There were many thousands of entries, and the winners were spectacularly good. In one case, a major corporation chose a Thinkquest entry as a central training tool to teach programming. Super high quality content was created in all subjects, in many languages. Results were judged by a net-enabled community of educators from around the world managed by the Internet Society. Special credit was given to development teams who collaborated across language barriers and time zones. A grand prize winning entry one year was created by kids from Japan, South Africa (a Zulu kid), and Poland. They used barely adequate language translation software to coordinate their efforts, but they managed. The library of entries was one of the most sought after destinations on the net during the late 1990s, even though it was not publicized. It had great international buzz in the worldwide net kid underground. So, the paradigm was to get the brightest kids to write the curriculum for the rest. Communication skills were taught along with the official topics. The contest was very cost effective since there weren't that many prizes, but there were lots of wonderful entries. Even kids who didn't win benefited hugely in gaining skills, visibility, and contacts. The problem with the Thinkquest model wasn't the cost of the prizes, but the cost of the judging and maintenance. It became harder to manage the vast amount of content and huge number of human relationships. Hard to prevent abuse, fraud, hate speech. Hard to assure fairness. The project was transferred to a larger foundation with greater resources that had a strict policy of maintaining a firewall on all its servers. Thus, while the Thinkquest library was still readable, the authors could no longer update it. (All authors were previously expected to keep their entries current, because untended digital information rapidly loses relevance and value. No one likes stale web content.) The first experiment in global curriculum, and in mass collaboration for student-created curriculum, slowly lost it's vitality and turned into a mere contest. What Thinkquest demonstrated was an almost successful model for distributing the overhead of content creation and classification to a huge number of people instead of artificial agents. It was close enough to being a success that it's worth trying to see if the design can be tweaked to completion. I wonder if it would be possible to reach an agreement within the technical community on something like Thinkquest as a foundation layer, with the possibility of automated content interpretation and creation being an option for the future. Automation is so easy to have illusions about when it comes to something like education, where it's hard to measure efficacy. In this regard I'm a skeptic that the new wave of testing in American education is doing what it claims. |
|
I don't doubt that people can learn a lot from using Google. Weblogging methods that let one labor one's way through the web in the aggregated footprints of other researchers are very handy, too. I think the promise here is impressive. However, my skepto-meter goes off when I see the same verbs used for computers that are used for human cognitive processes. I suspect that computation and cognition are profoundly different things and share the same verbs mostly through historical accident. I find this sentence unconvincing, for instance: "Aristotle knows what you know because it has worked with you for a long time."
Here's a similar elision: "Aristotle plans its lessons by finding chains of explanations that connect the concepts you need to learn to what you already know. It chooses the explanatory paths that match your favorite style of learning."
Mind you, anthills are superb at doing this, but there is no ant-planning or ant-choosing ever going on there. "Go to the ant, thou sluggard: consider her ways, and be wise." I like that advice, but ants don't get tenure. "Aristotle" has never been a student. It has no empathy. "Aristotle" doesn't "know" how it feels or what it means to get bored, or sharpen a pencil, or gaze absently out the window, or throw a spitball. "Aristotle" cannot discipline, it cannot entertain, it cannot enlighten or inspire. So that is not a teacher. I also have to wonder if this process is an "education," in the sense that the word traditionally had. There was once said to be no "royal road to geometry." The goal of this "Aristotle" is to build one, just for you. What if that worked? Then every classmate you possessed would a private, personal means of understanding geometry. Is that an "education", in the socializing, acculturating or character-building sense? The historical Aristotle poses problems as the perfect model of an ideal teacher, too. He worked for a tyrant. The historical Alexander did not deftly work out the fault-tolerant engineering of the Gordian knot. He succumbed to a fit of temper, hauled out his sword and hacked the knot in half. Then he overran half the world. If the human students of this highly deracinated, non-intelligent, non empathic, non-socializing education system all entered the same career as Alexander, then we would likely be in for a lively time. |
First let me say that I am glad Danny Hillis is getting interested in education. We will never be able to change education unless the best and brightest care to start thinking about what is wrong and what can be done. The knowledge web idea is a reasonable started but Hillis misses some key points about what is needed. When we begin to consider how computers can help in education we need to what is wrong with traditional school settings. Is school broken because students lack access to knowledge or to personal tutor? It could be improved by the construction of a dynamic knowledge base to be sure, but why would that be the key issue?
At least six things need to be changed in order to create school settings that work for those who have difficulties in traditional school settings.
Teachers would be great to have around if they were there only when you needed them. In this, Hillis is right. It is clear why traditional school settings do not operate in this fashion. One cannot expect a classroom of thirty children to each be doing their own thing, asking for the teacher's help as needed. Of course, some schools actually try to do this kind of thing, Montessori Schools for example. But, even those schools give up on this model of education when faced with curricula that must be taught and tests that must be passed. It is the curriculum and the size of the class that binds the teacher, making him or her a provider of information and judge of right answers rather than someone who is there for help as needed. Hillis' tutor is great for the adult student who really cares about physics, but most children are not trying to understand physics, they are trying to understand why someone is making them take physics.
People who talk about education have forever been mouthing aphorisms about teaching students to think for themselves. It is the Holy Grail of teaching. Everyone believes it, but very few do much about it. Robert Hutchins transformed the University of Chicago. But he didn't eliminate grades and therefore he didn't eliminate the teacher as an authority figure whose views must be followed. Hillis' tutor would do that and therefore it might actually be consulted by students. But most students are trying to figure out how to please the teacher and get a good grade. They are not like Hillis who is more motivated by truth than by grades one would assume. As long as the teacher makes a judgment about the student, many students will try to please him. It is simple human nature. Then, original thinking goes out the window. For students who feel they cannot or do not want to please the teacher, school becomes tedious and learning becomes quite difficult.
No matter how good a teacher is, they are still guided by the curriculum in which they are teaching. We must get over the idea that students who fail to learn algebra or chemistry or English literature would do so if only they had a really good tutor. We need to understand that there are other things to learn besides these subjects, and that a student who turns off to the traditional curriculum, designed in 1892 for a while different set of students and circumstances, can still lead a happy and productive life if only we would consider giving him or her a more meaningful and relevant educational experience. Schools are armored against change. But, in the last few years, a chink has appeared in that armor. It may be possible to exploit it. That chink is the internet. But it is not the knowledge contained on the internet that is the real hope here. The problem is not getting the net to contain better knowledge but getting the net to be the source for education not the supplement for education. Courses have been with us for so long that we simply accept that they have the structure, length, and characteristics that they should have and leave it at that. Web courses can be different than what is there now. They can be different for three reasons. (1) Current courses aren't very good and the new medium exposes their weak |