After doing some initial testing, manually entering items into the catalog, using the circulation and OPAC, and getting somewhat familiar with Koha, I decided it was time to import lots and lots of records. To do this, I used a copy of the book database at http://iblist.com/ . It is available under an open license for non-commercial use.
So, I had about 20,000 books to import, and this is what I discovered during the process:
The recommended way to import data into Koha is by converting it to MARC binary records, then importing with the supplied bulkmarcimport.pl script. I tried this, but later decided to use MARC21slim XML instead of the older binary MARC, due to character-encoding problems. Modifying the import script was trivial, and doing so saved me a lot of trouble.
My original translation path was as follows:
After discovering some encoding problems, I modified the process a bit. Instead of converting to binary MARC, I imported the MARC XML directly. This was faster, easier, and less error-prone.
During the process, the bulk of my time was spent writing a script to query the database and write the results in MODS format. This involved learning far more about MARC than I had expected, and modifying the MODS schema and stylesheet to accomodate MARC fields the LoC left out. I discovered quite a bit about Koha's MARC mappings during the process too, and had to modify some of Koha's templates slightly to display certain fields. I ran into a couple limitations in Koha 2.0.0 too, such as its lack of support for most repeating MARC fields.
The Koha users mailing list was very helpful throughout this process, especially in respect to finding the correct MARC fields to put data into.
After building translation tools, I found myself with a Koha database containing over 20,000 books. I am not yet finished with the process, but things are going reasonably well. I still need to figure out how to create in-stock items during the process, so I can later test circulation.
I intend to release the tools and scripts I used for this process, if anyone is interested. For now, contact me (ToyKeeper) if you would like copies, or more details.