tv American Artifacts CSPAN November 17, 2013 8:00am-8:32am EST
8:00 am
like other newspapers, it opened an avenue for people to lead discussions in other communities and upheld the liberty party and notified people where there would be meetings. it was the way to reach people, the kit scitizens who needed an wanted to be involved. throughout the weekend, american history tv is featuring michigan. learn more about ann arbor online. you are watching american history tv on c-span3.
8:01 am
each week american artifacts visits locations across the country. the archive offers permanent access for scholars, historians and the general public. >> the idea is to try to build the library version two. can you go and make all of the books, music, video available to all those that are interested enough. we are trying to do the pieces that are missing. maybe the historical pieces or the large scale hosting materials or taking the materials of people that people are puting up on their websites every day. the archive is large servers but
8:02 am
also physical holdings of books, music, video, and offering them for free public access anywhere in the world. the digital materials are on servers that are here in a church that we have bought in san francisco. we have converted a church to be a library. i love looking at them. they have lights on them. every time a light blinks it is someone uploading or downloading something. we get ten to fifteen million books downloaded a month. concert recordings are popular. we have the primary servers are here and the back up servers are
8:03 am
in california and in redwood city and a partial copy in egypt and in amsterdam. the idea is to have copies in other places so that we don't have the fate of burning and disappearing. it has been my idea to try to take advantage of this change that we can actually store things online and make them broadly available. it was always in the air. the promise of having the library of congress on your desk forever. it has been a long journey and then building some of the computers and establishing systems that became the worldwide web. the early days trying to get
8:04 am
this open structure to work and building the 1996 archive. and now, we digitizing 1,000 books a day and making them publicly available. >> i'm the coordinator here at the san francisco scanning center for internet archive. we have 33 scanning locations and this is one of them in which we were working on digitizing books, microfilm and also regular film. we have three main steps and we have our second step which is the image capture step. tina is working on our second step. this is the machine that we designed.
8:05 am
it is a way to digitize books that is non destructive that provided a high quality image once the book is online. we have two professional grade cameras one in the upper right and upper left. what tina will do is go through each page and take an image. we have two pains of glass on top and what that does is, it presses the page so that it is flat and that will provide a nice clear, image once it is online. it is a versatile set up we can scan all types of books and we want to be prepared for that. after the image capture step we have the third and final step.
8:06 am
we call this the republishing step we want to ensure that every page is present and make sure the images are clear and the content has been captured. what she is doing now is performing the kquality check ad setting the book up now to be used online. all of the content inside that crop box will display online. and once karina verifies that the quality is correct and the presentation is correct she will upload the book. once you look at the item on our website, we have an inbrowser
8:07 am
read reader. you are able to page through and search the text and view it in a number of ways. there are a number of file for mats that you can download as well. a pdf, a kindle file and an e feel. combined with all of the steps it takes roughly an hour. we have over three million books online. we put the images online and put it in lots of different formats. we make it available on the kindle the nook and we have to keep converting these materials to new formats. the one i'm excited about is the
8:08 am
talking book that is made available to the blind of dyslexic. they can go to the library of congress and get a qualification. we have digitized everything and those are available for free. there are 500,000 books that are available for those. if they have to buy a commercial device that will speak to them, it talks a little bit like this. but it is a format that we can do this to. investigation before the president's commission on the assassination of president kennedy pursuant to executive order 11130. to report upon the facts
8:09 am
relating to the assassination of the president and the death of the man charged with the assassination. >> openlibrary.org is the website to go and find these books. also books to borrow. there are 250,000 books only one person at a time it is very popular among people doing research. of ten we don't have the newest books, but we have something on every subject. actually the whole internet archive is a bargain. it is $12 million a year. there are 150 people that work for the archive but the reason
8:10 am
why it can be that inexpensive is we are doing a tiny part of the work. there are volunteers all over the world that are building the collections. we are hosting them. it is like going and saying that the publisher superintendant e. the cultural materials are being built. >> half of the money from the archive comes from the money to pay us books and and collect archives. we have room in the library of congress itself where we are digitizing books all day long. half of the money comes from foundations of private individuals. the way back machine is a way to
8:11 am
get access to the web headi iho. we started having a computer contact every website and click on every link on it and download the pages and images to reproduce that website after the fact. we do this for all of them every two months. we take a snapshot and it is starting to get big. we collect a billion pages each week. the total number of pages in the collection is 283 billion. that is just the web data. so it is five peta bytes of material. that is made available to people sometimes in bulk but most people go to the way back
8:12 am
machine which is named after the rockie and bull winkle show. and it makes it so that you can type in the url and see the web as it was. out of print webpages. it gets 600,000 people a day using it. it gets 1,000 to 2,000 a second. it is serve aing a roll of maki the web reliable. so in this digital world we have to be proactive to go and collect the materials.
8:13 am
we started working with the smithsonian institution. they put a display in their memorabilia room. he said maybe it is going to be like the bumper sticker. that was the first collection that we did. we worked closely with the library of congress. our next one was working to archive the year 2000 websites. we worked closely with the library of congress. they have been great partners. we give them back the data and they store it forever. they have been instrumental in making it so that the web is a living document through the internet archive and we then
8:14 am
help them by building their collections. they work with the national archives every two years to archive the end of term government archives. and that is available for the nara.gov website. but it is when you go and use that, it is using our servers. they keep an off line copy. but we are actually the ones that host those materials. the different industries are set up to protect their existing interests. we are finding that every field needs to be addressed differently. we do that. it is an opt-out type system. with books. we digitize the public domain books and make them available. with the television collection
8:15 am
which is a mfantastic new research you get to search based on closed captions for free but you only get 30 seconds back. if you want more, then you borrow the whole program then we put it on a dvd and you send it back. if we deal with them respectively and as a library, non profit, it seems to all around work. we have over a million different moving images to have available and remix. there is lots of things that people have uploaded. there is one called fedflix which is a collection that comes from the u.s. government. and he has got ten the funding.
8:16 am
publ publicresource.org has gotten the old types from the archives and made them available for pfre public reuse. all the outputs of the united states government are public domain. you should be able to then get to it >> our government is the care taker for these stores lie fallow today but they could become a platform that provides access to all. prior efforts have been half hearted. we should be spending minimum of $250 million a year a decade.
8:17 am
the smithsonian, the national archives, the library of congress and national library of medicine must work together to make congress the foundations and the public all clamor to help them create this platform. >> it is difficult to get to. it is like building a national park but then making it hard for people to get to it. and he has taken it as a multi decade project. sometimes he has to buy them to get access to them and make them publicly available. he is one of my heros. it used to cost quite a bit to get that. you see what is there and done
8:18 am
download that and do whatever you want. >> i'm rick pralinger i'm founder of the archives of film located in san francisco. both of the archives collections have large collections of federal government produced films. but the federal government historically is the world's biggest media maker. and their films run the gamut from industrial training films about aircraft riveting to policy releated films that might be a recording of a speech to great porks warks works of art. years of lightening.
8:19 am
day of drums. the life of jfk. but there is a lot of every day government films that are not so well-known. one that i like a lot is "tuesday in november" x. the score is by virgil thompson. it was made in 1945 by the office of information to be seen oversees and it is simple language. this is an american city. it's name is riverton. it is not too big nor too small. it is early morning of the first tuesday in november. this is an american city. a city that is not very large, not very rich, not very old. >> and it is made to be translated into many languages.
8:20 am
the idea of the film is that we are such a strong democratic country that we can hold a presidential election in the middle of a war, 1944 and the country will survive. nd it dramatizes how the people are invested. it is certainly a film that people should see today. i think we tend to think that it has always been this way. people that feel anti-government, whatever their rationale is for those feelings, i think it is interesting for them to look at a period when people were more engaged and try to understand why. the federal government films, i likes ones with smokey bear and mr. zip but the adventures about
8:21 am
junior raindrop about erosion. >> out tumble millions of earth bound rain drops. >> ouch. fine welcome this is. >> you know, little run away rain drops love forests where too many trees have been cut down. >> people always love the military etiquette films. >> in addition to cleanliness good grooming means good taste. to be well groomed in the back your hair should neither be too long nor too short but an appropriate length in the cap. it should not extend below the bottom edge of the collar.
8:22 am
this obviously won't do and neither will this. you are in the woman's army not the men's and should always trive to look feminine. >> the american nuclear testing program was heavily, heavily documented and there was the scientific photography which was unedited and quite of ten those were used in other films. and those were oriented to promote a particular program. i think the government glfilms that have meant the most to a lot of people are the nasa films. so much of our sense of space and exploration and pioneering has been created by foottage that was shot by employees in
8:23 am
space. nasa has made wonderful films over the years. tower clear. ♪ ♪ >> apollo 11, they and millions and others reached out and in their own way touched another planet. >> who is going to control history? who is going to control the cultural heritage? is it going to be a corporate project or is it going to be a public project with a
8:24 am
participation with a group? we have examples of this in the past. we have a public library system that has done a tremendous job at being free to all and we need to make sure that in the digital age that this continues. while this involves new technology, it is traditional how can we keep the threads of knowledge going and make sure they are accessible to the public? >> we hope this becomes a central library and a hod dell f model to digitize their holdings. wouldn't it be great to have everything at harvard and princeton and yale online? and if you have enough interest that you can get ahold of it without having to go and be admitted to one of these great libraries and institutions that
8:25 am
is what we are looking for. we are looking for the wikipedia generation to have access to it but to have computers have access to it so you can have new patterns and that type of thing you need to be able to do searches. that kind of computer engagement with these materials is now possible. we are trying to make it so that people can dive in. whether they are finding new patterns railroad a s or findin generation of google. i'm a geek. i went to mit and studied with
8:26 am
artificial intelligence. we thought about building a global brain and we thought at least it better that read all of the good books. where are we? >> first you have to scope the problem how big is it? if you want to put all of the published works online how big a problem is it? we don't know. the largest print library in the world is the library of congress. 26 million volumes. and a book if you had a book, is about a megabyte. so 26 million megabytes is 26 terabytes. and it fits in a system about this big and it costs about
8:27 am
$60,000 so for the cost of a house or around here a garage you can have spinning all of the words of congress. >> we can continue to work on this for decades and it will never be a boring, dull day. the internet is a fun vibrant environment to be a part of. 100 people work in the scanning centers scanning books mostly and there are 40 of us that work here in san francisco that are programers, administrators, lie brians and we are the ones that are bringing the collections together. >> we would like to say that we bought this building because it matches the logo. we have columns on our logo but this really was a building that
8:28 am
was available. san francisco is a city that is hard to find spis. this space seemed perfect and we were able to come in and be under one roof. this was the place of worship for the christian scientists. every time you see a light blinking that is data coming in or out. these are our te rx racotta soldiers when someone has worked with the archives for three years, bruce ter commissions a statue. there is one of these for anybody who has been with the organization for three years. i'm in the second row on the left from where you are there. really the hower of tpower of t
8:29 am
is the number of people out there that are doing wonderful projects. we are, archiving those and putting them out through the wayback machine. we are a small hub but the real work is going on by thousands and thousands of enthusiasts they make sure that they want their grand fathers works still around and they want these particular passions. how can we go and take the work of these people and make sure that it endures? >> you can explore the digital
8:30 am
materials at archive.org. links on the main page take you to the collections. you can find publications created by the united states government. ♪ ♪ >> our story begins with a young lady named margaret oliver. you out to meet her because she is working for four out of every five americans. you see she is your representative at the social security administration. her job is to tell you what your rights are. this november 22nd, marks the 50th anniversary of president kennedy's assassination.
8:31 am
we will look back on the president's policies. he stood before the united nation's general assembly for what would be his last address to that body. he noted that the delegates with meeting in an atmosphere of rising home. he spoke of the space race evolving into joint exploration with the soviet union. mr. president, as one who has taken some interest in the election of presidents i want to congratulation you on your election to this high office. mr. secretary general, delegates, ladies and gentlemen
178 Views
1 Favorite
IN COLLECTIONS
CSPAN3Uploaded by TV Archive on
