Getting Meta content by name using HTMLAgility
ItemTitle = TrimString(document.DocumentNode.SelectSingleNode("//meta[@name='title']").Attributes["content"].Value);
ItemTitle = TrimString(document.DocumentNode.SelectSingleNode("//meta[@name='title']").Attributes["content"].Value);
string body = ""; private void thebrowser_DocumentCompleted(object sender, WebBrowserDocumentCompletedEventArgs e) { if (thebrowser.ReadyState != WebBrowserReadyState.Complete) return; if (e.Url.AbsolutePath != (sender as WebBrowser).Url.AbsolutePath) return; if (body == thebrowser.Document.Body.InnerHtml) return; body = thebrowser.Document.Body.InnerHtml; }
It is matter of ServicePoint. Which provides connection management for HTTP connections. The default maximum number of concurrent connections allowed by a ServicePoint object is 2. So if you need to increase it you can use ServicePointManager.DefaultConnectionLimit property. Just check the link in MSDN there you can see a sample. And set the value you need.
System.Net.ServicePointManager.DefaultConnectionLimit = 1000; //or some other number > 4
Scraping time |
http://www.mediafire.com/download/ndb4dygkz1mjca5/AsinGraber_EEv2.1.zip
https://www.mediafire.com/file/y0h2hlb7ooxaswy/AGE.E+title+v3.0.2.zip/file
+ fixing gak bisa scraping
v3.0.3
https://www.mediafire.com/file/yfznjzwp4iksong/AGE.E+title+v3.0.3.7z/file
or
or
password = tulisanlain
+ fixing gak bisa scraping