Thursday, July 30, 2015

July 30

I'm still working on the 1936 Wallulah. I found one pair of duplicates after I had already sent the files through the Scanning Station so there is no "page 8" or "page9" in the final processed file names. I also deleted the duplicates from the TIF Folder and JPG Folder in drobo.

Wednesday, July 29, 2015

1935 Finished

Today I finished the 1935 Wallulah. The file numbers of the final processed files are a bit off since I found two sets of duplicates after I had already run the files through the Scanning Station. There are no files named page30, page31, page44, or page45 which makes the total number of pages 168 rather than 172.

I also began working on the 1936 Wallulah. I used the .jpegs since the software had the same amount of difficulty recognizing text on the re-sized .tifs.

Monday, July 27, 2015

Monday July 27

Today I worked on processing the 1935 Wallulah using the re-sized .tifs. The frequency with which it recognizes characters seems to be comparable to the jpgs. I found duplicates of pages 26 and 27 (numbered 031 and 032 in drobo) which I deleted from drobo. However the files had been run through the Indexing Station before I realized the mistake. The drobo numbering begain with 002 so the ABBYY processed file numbering will now be missing pages 30 and 31.

Thursday, July 23, 2015

1934 Complete

Today I finished the 1934 Wallulah. I sent the 1935 Wallulah re-sized .tifs through the Scanning Station but can easily redo this if we'd rather continue using the .jpgs. When sending the files to a local folder it asks what format you'd like them in and I kept the preset selection of .jpg which we'd been using before. This can also be altered if I resend the images through the Scanning Station as I'm not sure if this was the right option.

I'll keep my normal schedule and won't come in tomorrow, however I will be in next Friday to make-up the for missing work on Tuesday.

Wednesday, July 22, 2015

July 22

Today I continued working on the 1934 Wallulah and it is over halfway complete. I'm working on the index pages right now which is more time consuming since the software does not recognizing much of the text. I also have to redo the original 36 pages that were lost in the midst of the ABBYY update. I'm hoping to finish 1934 in its entirety tomorrow.

Thursday, July 16, 2015

1933 Complete

Today I finished the 1933 Wallulah and updated the spreadsheet. I began processing 1934 during the remainder of my workday.

Tuesday, July 14, 2015

July 14

Today I finally finished the 1932 Wallulah. We had one glitch with the software early in the morning on a text-heavy page but it worked fine the rest of the day. Along with making progress on the 1933 Wallulah, I printed materials for Mary and scanned and printed a booklet for Alice.

Friday, July 10, 2015

July 10

Today I made up the remaining hours from the days I've missed work and continued work on the 1932 Wallulah.The ABBYY software was doing a better job of recognizing text after I got through the student section and onto student activities where the text was larger. It still seems to have trouble with rosters where the names are in small type and close together but otherwise I think it's picking up text better overall. I'm over halfway done and expect to finish 1932 on Monday.

Thursday, July 9, 2015

Major Redo

In the middle of the day in the midst of working on the 1933 Wallulah, Sara caught that I had loaded the .tif files into ABBYY for the 1932 Wallulah rather than the .jpg files. I had done the same thing with the 1933 Wallulah so I deleted all the files I'd processed and started the 1932 Wallulah all over again. It seems to be recognizing less type on the student pages with small print than it did with the .tifs so I am having to type more than I did the first time.

Wednesday, July 8, 2015

Duplicate Dilemma

Today I continued working on the 1932 Wallulah and essentially completed it except for a file name discrepancy. There are two files of the same image in both the .jpg and .tif drobo 1932 folders named "page189" and "page189b." I only processed the first of these images since they are of the exact same page in the actual book. However, I was confused as to why there are two files and which one was preferable to use for the final processing through ABBYY.

The duplicate images also messed up the numbering for the processed files as I had originally put both of them through the Scanning Station. I wasn't sure if I should rename the subsequent files since there is no "page 191" among the processed files.

I assumed we would address this issue tomorrow and preceded onto the 1933 Wallulah for the remainder of my work day.

Tuesday, July 7, 2015

Cleaning Day

In the morning we cleaned the Digital Production Lab. In addition to cleaning the keyboards, I dusted the desks and behind the computers. The rest of the day I worked on processing the 1932 Wallulah in ABBYY. The software seemed to be cooperating today as it didn't crash and was able to read most of the text. I'm not sure if this is due to the readability of the text in the yearbook itself or not. Regardless the process went much quicker and I'm halfway finished. In the afternoon I also completed a small Archives scanning project for Chris.

Monday, July 6, 2015

July 6

Today I was finally able to finish processing the 1931 Wallulah through the ABBYY software. Verification Station crashed four times while trying to process the same text-heavy page. After a couple hours of not using the program, it was finally able to process the page which had been causing so much trouble. During this interim I reviewed the 1966 Wallulah and cropped all skewed pages. However several pages had cut-off text so I am in the middle of editing the re-scanned images and replacing the originals.

Thursday, July 2, 2015

Back to Work

Today was my first day after a two-week vacation. I re-scanned a couple pages in the 1958 and 1960 Wallulahs which had been cut out of the periodical edition. The files are named "page#a" and "page#b" with the number of the page that proceeded the cut-out page so they show up in the correct order. I can re-name all the files in the folder if necessary though.

Then I resumed processing the 1931 Wallulah through ABBYY. Unfortunately, it is not recognizing any of the text on the index pages so I am having to re-type them completely which is rather time consuming. Twice today the software has shut down and stopped working. Thankfully, both glitches were hours apart so hopefully it doesn't start shutting down more frequently like before the update.