Friday, November 20, 2015

Almost Finished 1940

I keep forgetting to post on here but I'm almost finished with the 1940 Wallulah. Only 45 pages left! I plan on coming in on Monday so hopefully I can complete it then.

Wednesday, September 9, 2015

September 9

Sorry I've neglected to post on here. I'm still working on the 1939 Wallulah. Many of the pages were crooked so I've been editing almost every page in Photoshop. The edited images are saved in drobo and the .jpg files are re-sized to the standard 1200x1670.

I think I forgot to mention that in the 1938 Wallulah there is no "page036" among the drobo .jpgs and .tifs. No actual page from the yearbook is missing but rather just this file title so consecutive pages are titled "page035" and "page037."

In 1938 there was also one set of duplicates. I didn't catch this until after I'd run the images through the Scanning Station so there is no "page 56" in the ABBYY and no "page057" in drobo (the drobo files start at "page002").

Tuesday, September 1, 2015

1938 Complete

Just an update to let you know I finished the 1938 Wallulah yesterday during my shift. All the processed files are uploaded to drobo and the spreadsheet is updated. I'm now working on 1939.

Tuesday, August 11, 2015

Last Day of Summer Work: 1937 Complete

Yesterday I finally finished the 1937 Wallulah and placed all the processed files in drobo. There were numerous indices I had to type out so it took a little longer than expected and I may have gone over the allotted internship hour limit. I will not be back to work during the remainder of the summer but will email you my proposed hours to work during the school year.

Tuesday, August 4, 2015

Progress

Yesterday I scanned and edited a collection of Rex Amos material for Audrey Daniels.

Today I finished the 1936 Wallulah and began work on 1937. I also re-scanned some pages of the 1958 Wallulah which had text cropped out.

I will try to finish 1937 in the next two days since I will have hit the 300 hour limit. However, I would like to complete it before taking a break so I'll come in Monday if necessary.

Thursday, July 30, 2015

July 30

I'm still working on the 1936 Wallulah. I found one pair of duplicates after I had already sent the files through the Scanning Station so there is no "page 8" or "page9" in the final processed file names. I also deleted the duplicates from the TIF Folder and JPG Folder in drobo.

Wednesday, July 29, 2015

1935 Finished

Today I finished the 1935 Wallulah. The file numbers of the final processed files are a bit off since I found two sets of duplicates after I had already run the files through the Scanning Station. There are no files named page30, page31, page44, or page45 which makes the total number of pages 168 rather than 172.

I also began working on the 1936 Wallulah. I used the .jpegs since the software had the same amount of difficulty recognizing text on the re-sized .tifs.

Monday, July 27, 2015

Monday July 27

Today I worked on processing the 1935 Wallulah using the re-sized .tifs. The frequency with which it recognizes characters seems to be comparable to the jpgs. I found duplicates of pages 26 and 27 (numbered 031 and 032 in drobo) which I deleted from drobo. However the files had been run through the Indexing Station before I realized the mistake. The drobo numbering begain with 002 so the ABBYY processed file numbering will now be missing pages 30 and 31.

Thursday, July 23, 2015

1934 Complete

Today I finished the 1934 Wallulah. I sent the 1935 Wallulah re-sized .tifs through the Scanning Station but can easily redo this if we'd rather continue using the .jpgs. When sending the files to a local folder it asks what format you'd like them in and I kept the preset selection of .jpg which we'd been using before. This can also be altered if I resend the images through the Scanning Station as I'm not sure if this was the right option.

I'll keep my normal schedule and won't come in tomorrow, however I will be in next Friday to make-up the for missing work on Tuesday.

Wednesday, July 22, 2015

July 22

Today I continued working on the 1934 Wallulah and it is over halfway complete. I'm working on the index pages right now which is more time consuming since the software does not recognizing much of the text. I also have to redo the original 36 pages that were lost in the midst of the ABBYY update. I'm hoping to finish 1934 in its entirety tomorrow.

Thursday, July 16, 2015

1933 Complete

Today I finished the 1933 Wallulah and updated the spreadsheet. I began processing 1934 during the remainder of my workday.

Tuesday, July 14, 2015

July 14

Today I finally finished the 1932 Wallulah. We had one glitch with the software early in the morning on a text-heavy page but it worked fine the rest of the day. Along with making progress on the 1933 Wallulah, I printed materials for Mary and scanned and printed a booklet for Alice.

Friday, July 10, 2015

July 10

Today I made up the remaining hours from the days I've missed work and continued work on the 1932 Wallulah.The ABBYY software was doing a better job of recognizing text after I got through the student section and onto student activities where the text was larger. It still seems to have trouble with rosters where the names are in small type and close together but otherwise I think it's picking up text better overall. I'm over halfway done and expect to finish 1932 on Monday.

Thursday, July 9, 2015

Major Redo

In the middle of the day in the midst of working on the 1933 Wallulah, Sara caught that I had loaded the .tif files into ABBYY for the 1932 Wallulah rather than the .jpg files. I had done the same thing with the 1933 Wallulah so I deleted all the files I'd processed and started the 1932 Wallulah all over again. It seems to be recognizing less type on the student pages with small print than it did with the .tifs so I am having to type more than I did the first time.

Wednesday, July 8, 2015

Duplicate Dilemma

Today I continued working on the 1932 Wallulah and essentially completed it except for a file name discrepancy. There are two files of the same image in both the .jpg and .tif drobo 1932 folders named "page189" and "page189b." I only processed the first of these images since they are of the exact same page in the actual book. However, I was confused as to why there are two files and which one was preferable to use for the final processing through ABBYY.

The duplicate images also messed up the numbering for the processed files as I had originally put both of them through the Scanning Station. I wasn't sure if I should rename the subsequent files since there is no "page 191" among the processed files.

I assumed we would address this issue tomorrow and preceded onto the 1933 Wallulah for the remainder of my work day.

Tuesday, July 7, 2015

Cleaning Day

In the morning we cleaned the Digital Production Lab. In addition to cleaning the keyboards, I dusted the desks and behind the computers. The rest of the day I worked on processing the 1932 Wallulah in ABBYY. The software seemed to be cooperating today as it didn't crash and was able to read most of the text. I'm not sure if this is due to the readability of the text in the yearbook itself or not. Regardless the process went much quicker and I'm halfway finished. In the afternoon I also completed a small Archives scanning project for Chris.

Monday, July 6, 2015

July 6

Today I was finally able to finish processing the 1931 Wallulah through the ABBYY software. Verification Station crashed four times while trying to process the same text-heavy page. After a couple hours of not using the program, it was finally able to process the page which had been causing so much trouble. During this interim I reviewed the 1966 Wallulah and cropped all skewed pages. However several pages had cut-off text so I am in the middle of editing the re-scanned images and replacing the originals.

Thursday, July 2, 2015

Back to Work

Today was my first day after a two-week vacation. I re-scanned a couple pages in the 1958 and 1960 Wallulahs which had been cut out of the periodical edition. The files are named "page#a" and "page#b" with the number of the page that proceeded the cut-out page so they show up in the correct order. I can re-name all the files in the folder if necessary though.

Then I resumed processing the 1931 Wallulah through ABBYY. Unfortunately, it is not recognizing any of the text on the index pages so I am having to re-type them completely which is rather time consuming. Twice today the software has shut down and stopped working. Thankfully, both glitches were hours apart so hopefully it doesn't start shutting down more frequently like before the update.

Tuesday, June 16, 2015

June 16

Today was my last day before I leave for 3 weeks to go back home. I wasn't able to fully complete 1931 like I'd hoped but I only have 60 pages left. The software isn't recognizing much of the text so typing it all in has slowed me down a bit.

I also scanned some pages that were missing or skewed from the 1957 Wallulah, which have been placed in the appropriate folders. I had forgotten to go back and look through 1964 after I added the re-scanned images yesterday so I reviewed it again and it looks to be complete.

Monday, June 15, 2015

June 15

Today I finally finished re-scanning and editing all the cut-off pages from the 1964 Wallulah and updated the spreadsheet. I resumed processing the 1931 Wallulah through ABBYY and I'm hoping to finish it tomorrow and start on 1932.

Thursday, June 11, 2015

ABBYY Resurrected

On Thursday I started by continuing to re-scan and edit pages of the 1964 Wallulah whose page numbers had been cut-off in the original scans.

Bill called ABBYY and they were finally able to fix the software. It's worked without a glitch for several hours and it may be my imagination but it even seemed to be able to read the smaller text on the student pages better.

I finished the batch that had been in the Verification Station and then went back to the scanning 1964 because I'd like to finish that as soon as possible. I'm still processing 1931 Wallulah but since the software is reading text better it should go even quicker.

I won't be in to work tomorrow since I made up my hours from missing Monday by staying late the last couple nights. Next week, I'll only be here on Monday and Tuesday since I'm flying home Wednesday and I'll be back the week of July 6.

Wednesday, June 10, 2015

June 10

Today I finished reviewing the the 1962 and 1963 Wallulah. The page numbers in the 1964 are on the inside of the binding so they're partial or missing on many of the scans. There were a considerable number of skewed pages as well. I'm currently in the process of re-scanning the 60 or so pages whose page numbers were cut off and editing them.

Tuesday, June 9, 2015

June 9

Today I continued to review the 1962 Wallulah. Several of the "Living Area" pages with pictures of students in their respective residence halls were cut off so I re-scanned them. 1962 seemed to have more skewed and cut off pages than 1961 but after I finish editing the newly scanned pages it should be complete.

Friday, June 5, 2015

June 5

Today I started by reviewing the Wallulahs again, which I will be doing next week as well unless the ABBYY software is fixed. I finished 1961 and started on 1962, however I was asked to do some scans so I wasn't able to make anymore progress.

Since the scans were of paintings in art books, I tried to use The Beast, which I haven't had a lot of experience with. The glass was a little dusty so I tried to clean it with Windex but it left streaks. I was able to get them mostly off with water.

Afterwards, I was able to get everything focused and working properly, however the right page (left camera) was always significantly lighter than the other. The pictures also looked a little dusty/blurry even though I had focused the cameras. I'm not sure if these were calibration/set-up issuse or because the glass was not entirely clean. Shanel said she had a microfiber cloth so I'll try to use that to clean the glass on Monday.

In the end, I decided to use Feynman so I could scan them at a higher resolution which worked well since it captured the detail in the paintings.

Thursday, June 4, 2015

The ABBYY Debacle

On Thursday I started work on processing the 1931 Wallulah, however an error message kept coming up from Windows and the software would close. I spoke with the vendor Harold on the phone who found the problem listed in the program "Event Viewer." However he didn't know what was wrong or how to fix it so he is passing the issue along to ABBYY, the manufacturer. He couldn't give an estimate for how long it will take to be fixed but he said he would contact us with information either through phone or respond to the original email. We can also contact him directly if we'd like an update on the issue.

Although I had to try a couple times to reach Harold, someone named Steve called the Digital Kingdom phone and said he'd received two calls from the digitization number. I wasn't sure if he was connected to the ABBYY software inquiry but he wouldn't tell me what company/organization he was with and his number was not one of the ones I had dialed. The whole interaction was very strange.

Since I wasn't able to work on processing the text on the Wallulahs, I started to look for skewed pages on drobo. I started with 1961 and have found quite a few so far but they only require minor adjustments in Photoshop. I've been replacing the old images with the edited ones. I'll be in tomorrow from 8:30-1:15 in order to make-up hours from when I missed a day the first week of work soI can continue to review the Wallulahs.

Wednesday, June 3, 2015

June 3

Today I finished the 1930 Wallulah and started on 1931. I re-processed the three pages from 1928 Wallulah and placed them back in their respective Temp folder on drobo. The Scanning Station re-named all the files "page #" so I corrected them to match their original file names.

I also noticed it had changed the format of the 1929 and 1930 Wallulah file names from the original "Page_#" format. Should this be corrected?

Tuesday, June 2, 2015

June 2

Today I continued working on the 1930 Wallulah and only have 40 pages left to process. I also scanned a library floorplan and a picture of Rogers Hall for the Bill Willingham collection. I was able to enlarge the library floorplan, however the picture of Rogers Hall remained pixelated even with editing.

Monday, June 1, 2015

June 1

Today I finally finished the 1929 Wallulah. The index pages at the end of 1929 slowed me down a bit, but I've definitely got the hang of it now, as I was able to also complete at least 1/3 of 1930 Wallulah which I will continue working on tomorrow.

Thursday, May 28, 2015

Terminating the Twenties

I'm still working on the 1929 Wallulah, however I only have 20 pages left to process so I should finish early tomorrow and then I'll start on the 1930s. Thankfully, the type has gotten less decorative as the book progresses so the software reads text much more accurately and makes the process go a lot quicker.

Wednesday, May 27, 2015

May 27

Today I continued working on processing the text on the 1929 Wallulah pages and made considerable progress. It's going a lot quicker now that I've got the hang of it and the pages aren't as text-heavy as the student pages. Hopefully, I'll be able to finish 1929 by the end of this week.

Tuesday, May 26, 2015

May 26

Today I continued working on the 1929 Wallulah and was able to make quite a bit of progress. After I completed the student name pages with lots of text, the process went much quicker.

However, I did come across a couple problems near the end of the day. I noticed one of the pages looks skewed in the Verification Station but when I opened the .jpg file on Drobo it looks straight.

Also, while searching for this file, I noticed that the file names are 1+ on ABBYY, for example 088.jpg on ABBYY is 087.jpg on Drobo. I'm not sure what caused this increase in file numbering and I'm not sure how to go about finding the problem or correcting it but I'll ask tomorrow.

Friday, May 22, 2015

Sneaking in a post

Hello it's Bronte!

Just wanted to leave an update on where I am:

Currently working on the digital exhibit for the Sackett Collection. I have all the images I want to use downloaded. There are way too many I want in the exhibit right now, but I'll try to keep it to a couple dozen or so. I tried to get some of them on IntelliJ but I don't think I did it right? I'll definitely need some help with it on Tuesday.

Have a great weekend.
Bronte

Wednesday, May 20, 2015

Working through Wallulah

Today I resumed processing the text on the 1929 Wallulah pages. I was able to clarify all the steps with Sara and make some progress through the first portion of the book. However, the verification software has a fair amount of trouble reading decorative fonts and very small type size so I'm finding I have to re-type in several of the text boxes. The student pages in particular take a little more time since each student has a list of all their activities throughout college so the type is especially small and unrecognized by ABBYY. Otherwise, I'm making progress and continue to work on this tomorrow.

Tuesday, May 19, 2015

Day Two

Today I finished putting all the Blain Diary transcriptions into individual text files and was able to scan the few pages of the diary I had missed originally.

Following this, I started on identifying the text on scanned Wallulah pages. I imported all the images from 1929 Wallulah into the ScanStation and proceeded to put them through the VerificationStation. However, when I transfer them to the VerificationStation they end up out of order. I was able to do the first 6 pages but didn't get much further. I also tried to accept documents through the IndexingStation but the pages didn't appear in the proper Output folder so I'm very confused where they ended up. I'll ask Sara about all these issues tomorrow morning when I begin work.

Monday, May 18, 2015

Beginning Blain

Today was my first day working as a summer intern in the Digital Kingdom.

I started by dividing the entire transcription of the Blain Diary into separate text documents that match up with each scanned page. We decided not to include the transcriber's notes but rather provide a link to the full document with the transcription and notes. I've completed this process up to approx. mid-September in the diary so I think I'll be able to finish this tomorrow.

I also discovered that I had originally missed scanning a few pages in the diary and this resulted in some major re-numbering of the .tif and .jpg file names. The .tif file names are now up-to-date, however I still need to revise the .jpg file names, which I should also be able to complete tomorrow.

Friday, April 17, 2015

April 17

This week:

I finished rescanning and editing the 2006 Wallulah
I finished editing the 1947 Wallulah
Sara and Bill set me up with the online exhibit program which I will start this weekend

Friday, April 10, 2015

April 10

Today:

1. Rescanned 2006 Wallulah on the book scanner. Still editing. Will probably be done on Tuesday.

2. Introduction to Sackett exhibit online/kiosk

Thursday, April 9, 2015

April 9

My last blog post was dated September 24. Apparently, I was busy then. I looked at that post today and thought "that's cute."

But I am back on the blog for these last few weeks of my archive work at Willamette.

Here's the update:

1. The Sackett Collection. I stayed up way too late last night and missed the meeting today where we were supposed to look at the digital platform for the photos. I feel so bad about that. Mostly because I really am so excited about this and so grateful for all the support I've received from you all with this project. It's been amazing. Anyway, I will go to bed at a decent hour tonight and will be ready to go tomorrow at 10 a.m.

2. I scanned all the history theses and they are saved to J:\THESES_History

3. I rescanned page 25 from the 1927 Wallulah. tifs and jpgs are saved in Drobo.

4. I will rescan the 2006 Wallulah tomorrow and hopefully get rid of that weird glare/streak.

Hopefully I remember to do this again tomorrow.