small medium large xlarge

Text Processing with Ruby, in beta

April 29, 2015

Plain text is everywhere. Web pages, databases, code, the contents of files–for almost any programming task you perform, you need to process text. Level up your text processing skills with Text Processing with Ruby: Extract Value from the Data That Surrounds You, now available in beta from

Come and get it!

Text Processing with Ruby: Extract Value from the Data That Surrounds You

Most information in the world is in text format, and programmers often find themselves needing to make sense of the data hiding within. You want to do this efficiently, avoiding labor-intensive, manual work—and Ruby is ideally suited to this task.

Text Processing with Ruby takes a practical approach to working with text:

  • First, Acquire: Explore Ruby’s core and standard library, and what’s possible with IO and its derived classes like File. Extract text into your Ruby programs from the file system and standard input. Process delimited files such as CSVs, and write utilities that interact with other programs in text-processing pipelines. Process web pages with Nokogiri to pull out information from even the messiest of HTML, and decipher character encoding mysteries.
  • Second, Transform: Use regular expressions to match, extract, and replace patterns in text. Write a parser using Ruby’s StringScanner library. Use Natural Language Processing techniques to extract keywords and implement fuzzy searching.
  • Finally, Load: Write the transformed text and data to standard output, files and other processes. Serialize text into JSON, XML, and CVS, and use ERB to create more complex formats.

You'll soon be able to tackle even the most enormous and entangled text with ease, scything through gigabytes of data and effortlessly extracting the bits that matter.

Now available in beta from

Upcoming Author Appearances

  • 2015-04-29 Johanna Rothman,
    KWSQA, Waterloo CA
  • 2015-04-29 Seb Rose,
    CoDeOSL, Oslo
  • 2015-04-30 Johanna Rothman,
    KWSQA, Waterloo CA
  • 2015-05-07 Chris Adamson,
    CocoaConf Portland
  • 2015-05-07 Rachel Davies,
    Agile Manchester, UK
  • 2015-05-08 Johanna Rothman,
    Testing & Quality Leadership Summit, Orlando, FL
  • 2015-05-08 Chris Adamson,
    CocoaConf Portland
  • 2015-05-09 Chris Adamson,
    CocoaConf Portland
  • 2015-05-14 Seb Rose,
    SDD, London
  • 2015-05-29 Jeffrey R. Kelley,
    Self.conference, Detroit, MI
  • Did You Know?

    We publish books across a range of technologies for folks at different levels of abilities, all the way from novice to expert. If you haven't browsed all of our titles recently, come take a look:

    Agile Practices • Android, iPhone, and Mobile Programming • Career
    Development • Cool Things for Smart People • DIY & Hardware •
    For Beginners • Gaming • Java and JVM Languages • Mac, iPhone,
    and iPad Programming • Pragmatic exPress • Ruby and Rails •
    Testing, Design, and Cloud Computing • Tools, Frameworks, Languages
    • Web 2.0+

    Come check out all our categories at

    Don't Get Left Out

    Are your friends jealous that you get these spiffy email newsletters and they don't? Clue them in that all they need to do is create an account on (email address and password is all it takes) and select the checkbox to receive newsletters.

    Are you following us on Twitter and/or Facebook? Here's where you can find us and keep up with the latest news and commentary, and occasional discounts:

    Tell your friends! Tweet this

    Follow us on Twitter: pragprog</a>, <a href="">pragpub, Andy PragmaticAndy</a> and Dave <a href="">pragdave.

    Coming Soon:

    • Pragmatic Scala: Create Expressive, Concise, and Scalable Applications, in beta
    • Hello, Android, 4th Edition in print
    • Real-World Kanban: Do Less, Accomplish More with Lean Thinking, in beta

    Recently Released:

    Thanks for your continued support,

    Andy & Dave
    The Pragmatic Programmers

    Books • eBooks • PragPub Magazine • Audiobooks and Screencasts