mercredi 22 mai 2019

Web Scrapping a password web R

I want to sign into a webpage,then, read some nodes using R... I've tried some solutions as

https://github.com/tidyverse/rvest/blob/master/demo/united.R

Scrape password-protected website in R

https://www.biostars.org/p/297028/

I tried with rvest this code

url <-"https://semanaeconomica.com/#" 
session <- html_session(url)

form <- html_form(read_html(url))[[1]]

filled_form <- set_values(form,
                      user_login = "user",
                      user_pass = "pass")

r<- submit_form(form, filled_form)
r2<-jump_to(r, "http://semanaeconomica.com/article/sectores-y-empresas/comercio/360699-estrategia-frente-al-informal/")

The message "Status: 200" appear...But in the next steps, the results are character0, this is, the login fail!!!

r3<-read_html(r2)
p_text <- r3%>%
 html_nodes("address")%>%
html_text()%>%print()

Can you help me? Thanks!




Aucun commentaire:

Enregistrer un commentaire