Technology & Tools
The tools and technology used by Newbook Digital Texts make up a "clear path to output" process for texts prepared in a modified version of the Text Encoding Initiative (TEI) document format. The goal of this process is to produce well-formed, valid, structured data from literary, historical, pedagogical, and other sources in multiple languages and scripts not readily available in print.
NDT uses a set of freely available open-source tools, tools developed by NDT, or commercial software freely available for non-commercial projects, e.g.,Cladonia Software Exchanger XML editor, which can be used in a three phase process:
- Producing generic auto-tagged texts to create TEI conforming XML output
- Customizing the TEI encoded text for individual projects
- Processing TEI tagged texts to create standardized output for Web and Print
Tools and Samples: The Newbook Process
- Generic auto-tagging tools to simplify the structural tagging procedure for creating Text Encoding Initiative [TEI] conforming XML input (e.g. by student interns).
- The resulting generic TEI texts can then be hand coded, using specific, additional TEI tags, to meet the requirements unique to individual projects.
- Using XSLT processors to produce TEI tagged texts for standardized output for Web and Print: XHTML/HTML5, PDF, and e-pub format
Below is a link to a sample document that can be used with the tools above to produce output. It may also be used as a model for marking-up your own data for use with our tools.
Further details are available on GitHub.
The software tools listed below are readily available from sources on the Internet. Scripts and document samples developed by NDT can be downloaded from this site.
- UTF-8 Editors: Notepad++ (WinX), TextWrangler (OSX), vi, Emacs
- Cladonia Software Exchanger XML (Java-based, multi-platform): CHECKs XML for well-formedness, validity
- xmllint (Unix/Linux): DETECT errors in XML output
- JMS Auto-tagger (PERL-based): CONVERT plain text transcripts to TEI-XML
- XSLT scripts: CONVERT valid TEI-XML to XHTML/HTML5, LaTeX, tag-set lists
- TeX Live/MikTeX: CONVERT LaTeX sources to PDF
- validator.w3.org: markup validation service