c# - HtmlAgilityPack DocumentNode.SelectNodes returns null, shouldn't

Question

Welcome To Ask or Share your Answers For Others

c# - HtmlAgilityPack DocumentNode.SelectNodes returns null, shouldn't

posted Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

c# - HtmlAgilityPack DocumentNode.SelectNodes returns null, shouldn't

I'm trying to scrape content from an example page using the HTML agility pack. The DocumentNode.SelectNodes is returning null for an XPath query when I think it shouldn't. Could someone tell me why? The code is:

HtmlDocument doc = new HtmlDocument();
string xpath = "//h1[@class='product-title fn']"; // note, it still returns 
                                                  // null even with "//div"
doc.OptionFixNestedTags = true;
HtmlNode.ElementsFlags.Remove("form");
HtmlNode.ElementsFlags.Remove("option");

HtmlNodeCollection coll = doc.DocumentNode.SelectNodes(xpath);

if (coll != null)
{
    // do stuff
}
else
{
    // not expecting it to be null unless no matches
}

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

深蓝 · Answer 1 · 2021-10-23T21:33:50+0000

According to the upstream bug comments it is for consistency:

DarthObiwan wrote Jan 11 2011 at 9:27 PM

This has been covered before, this function is written to mimic the way the System.XML works. Doing so will cause a major breaking change and thus will probably be slated for 2.0

Categories

c# - HtmlAgilityPack DocumentNode.SelectNodes returns null, shouldn't

c# - HtmlAgilityPack DocumentNode.SelectNodes returns null, shouldn't

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags