Skip to content

Instantly share code, notes, and snippets.

@jeroenjanssens
Created April 18, 2017 17:39
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save jeroenjanssens/1a41fd1110dfa27cb2da2d6ad034afd7 to your computer and use it in GitHub Desktop.
Save jeroenjanssens/1a41fd1110dfa27cb2da2d6ad034afd7 to your computer and use it in GitHub Desktop.
library(rvest)
html_more_nodes <- function(session, css, more_css) {
xml2:::xml_nodeset(c(
html_nodes(session, css),
tryCatch({
html_more_nodes(follow_link(session, css = more_css),
css, more_css)
}, error = function(e) NULL)
))
}
# Follow "More" link to get all stories on Hacker News
html_session("https://news.ycombinator.com") %>%
html_more_nodes(".storylink", ".morelink") %>%
html_text()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment