Watch Me Code, Episode 8

Written by: Stephen Connolly
2 min read
Stay connected

In Episode 7 of "Watch Me Code," we parsed some dates by scraping HTML, not content with just scraping a date, we continue scraping more details from various HTML pages in the GitWeb view.

This episode starts with some explanation as to how we can safely assume the RFC 2822 date format from Episode 7. After that we proceed to implement the HTML scraping methods required to list the branches and tags and to resolve the latest revision of a tag/branch.

If you are interested in examining my work so far, you can look at the GitHub code snapshot as of the end of this video.

Stephen Connolly is a member of the engineering team at CloudBees. He has over 20 years experience in software development. He is involved in a number of open source projects, including Jenkins and Apache. Stephen was one of the first non-Sun committers to the Jenkins project and developed the weather icons. Stephen lives in Dublin, Ireland - where the weather icons are particularly useful. Follow Stephen on Twitter and on his blog .


Check out the entire series:

Stay up to date

We'll never share your email address and you can opt out at any time, we promise.