org.ccil.cowan.tagsoup.Parser类的使用及代码示例

x33g5p2x  于2022-01-26 转载在 其他  
字(7.9k)|赞(0)|评价(0)|浏览(151)

本文整理了Java中org.ccil.cowan.tagsoup.Parser类的一些代码示例,展示了Parser类的具体用法。这些代码示例主要来源于Github/Stackoverflow/Maven等平台,是从一些精选项目中提取出来的代码,具有较强的参考意义,能在一定程度帮忙到你。Parser类的具体详情如下:
包路径:org.ccil.cowan.tagsoup.Parser
类名称:Parser

Parser介绍

[英]The SAX parser class.
[中]SAX解析器类。

代码示例

代码示例来源:origin: rest-assured/rest-assured

  1. slurper = new XmlSlurper(config.isValidating(), config.isNamespaceAware(), config.isAllowDocTypeDeclaration());
  2. } else {
  3. XMLReader p = new org.ccil.cowan.tagsoup.Parser();
  4. slurper = new XmlSlurper(p);

代码示例来源:origin: seven332/EhViewer

  1. /**
  2. * Returns displayable styled text from the provided HTML string.
  3. * Any <img> tags in the HTML will use the specified ImageGetter
  4. * to request a representation of the image (use null if you don't
  5. * want this) and the specified TagHandler to handle unknown tags
  6. * (specify null if you don't want this).
  7. *
  8. * <p>This uses TagSoup to handle real HTML, including all of the brokenness found in the wild.
  9. */
  10. public static SpannableStringBuilder fromHtml(String source, ImageGetter imageGetter,
  11. TagHandler tagHandler) {
  12. Parser parser = new Parser();
  13. try {
  14. parser.setProperty(Parser.schemaProperty, HtmlParser.schema);
  15. } catch (org.xml.sax.SAXNotRecognizedException e) {
  16. // Should not happen.
  17. throw new RuntimeException(e);
  18. } catch (org.xml.sax.SAXNotSupportedException e) {
  19. // Should not happen.
  20. throw new RuntimeException(e);
  21. }
  22. HtmlToSpannedConverter converter =
  23. new HtmlToSpannedConverter(source, imageGetter, tagHandler,
  24. parser);
  25. return converter.convert();
  26. }

代码示例来源:origin: apache/tika

  1. org.ccil.cowan.tagsoup.Parser parser = new org.ccil.cowan.tagsoup.Parser();
  2. parser.setProperty(org.ccil.cowan.tagsoup.Parser.schemaProperty, schema);
  3. parser.setContentHandler(handler);
  4. parser.parse(new InputSource(new StringReader(codeAsHtml)));

代码示例来源:origin: fourlastor/dante

  1. @Override public void parse(String string) {
  2. org.ccil.cowan.tagsoup.Parser parser = new org.ccil.cowan.tagsoup.Parser();
  3. parser.setContentHandler(this);
  4. try {
  5. parser.parse(new InputSource(new StringReader(string)));
  6. } catch (IOException | SAXException e) {
  7. throw new HtmlParsingException(e);
  8. }
  9. emptyBuffer();
  10. }

代码示例来源:origin: com.github.livesense/org.liveSense.service.xssRemove

  1. /**
  2. * Creates a DeXSSParser with the following feature set:
  3. * <ul>
  4. * <li>{@link DeXSSFilterPipeline#BODY_ONLY} <code>true</code></li>
  5. * </ul>
  6. * And uses as parent a {@link org.ccil.cowan.tagsoup.Parser} with the following feature set:
  7. * <ul>
  8. * <li>{@link org.ccil.cowan.tagsoup.Parser#ignoreBogonsFeature} <code>true</code></li>
  9. * <li>{@link org.ccil.cowan.tagsoup.Parser#defaultAttributesFeature} <code>false</code></li>
  10. * </ul>
  11. * TODO: Should be made more configurable.
  12. */
  13. public DeXSSParser() throws SAXNotRecognizedException, SAXNotSupportedException {
  14. super();
  15. setFeature(DeXSSFilterPipeline.BODY_ONLY, true);
  16. Parser parser = new Parser();
  17. parser.setFeature(Parser.ignoreBogonsFeature, true);
  18. parser.setFeature(Parser.defaultAttributesFeature, false);
  19. setParent(parser);
  20. }
  21. }

代码示例来源:origin: com.xmlcalabash/xmlcalabash

  1. private XdmNode tagSoup(String text) {
  2. StringReader inputStream = new StringReader(text);
  3. InputSource source = new InputSource(inputStream);
  4. Parser parser = new Parser();
  5. parser.setEntityResolver(runtime.getResolver());
  6. SAXSource saxSource = new SAXSource(parser, source);
  7. DocumentBuilder builder = runtime.getProcessor().newDocumentBuilder();
  8. try {
  9. XdmNode doc = builder.build(saxSource);
  10. return doc;
  11. } catch (Exception e) {
  12. throw new XProcException(e);
  13. }
  14. }

代码示例来源:origin: org.ccil.cowan.tagsoup/tagsoup

  1. public void parse (String systemid) throws IOException, SAXException {
  2. parse(new InputSource(systemid));
  3. }

代码示例来源:origin: org.ccil.cowan.tagsoup/tagsoup

  1. public void setProperty(String name, Object value)
  2. throws SAXNotRecognizedException, SAXNotSupportedException
  3. {
  4. parser.setProperty(name, value);
  5. }

代码示例来源:origin: org.ccil.cowan.tagsoup/tagsoup

  1. public void setFeature(String name, boolean value)
  2. throws SAXNotRecognizedException, SAXNotSupportedException
  3. {
  4. parser.setFeature(name, value);
  5. }

代码示例来源:origin: apache/tika

  1. new org.ccil.cowan.tagsoup.Parser();
  2. parser.setProperty(
  3. org.ccil.cowan.tagsoup.Parser.schemaProperty, schema);
  4. parser.setFeature(
  5. org.ccil.cowan.tagsoup.Parser.ignoreBogonsFeature, true);
  6. parser.setContentHandler(new XHTMLDowngradeHandler(
  7. new HtmlHandler(mapper, handler, metadata, context, extractScripts)));
  8. parser.parse(reader.asInputSource());

代码示例来源:origin: gamesbyangelina/spritely

  1. + query);
  2. Parser p = new Parser();
  3. "http://www.colourlovers.com/ajax/search-palettes/_page_1?sortCol=votes&sortBy=desc&query="
  4. + query);
  5. p.setContentHandler(handler);
  6. p.parse(new InputSource(u.openStream()));
  7. p.setContentHandler(pandler);
  8. p.parse(new InputSource(new URL(s).openStream()));

代码示例来源:origin: com.github.livesense/org.liveSense.service.xssRemove

  1. super();
  2. Parser parser = new Parser();
  3. parser.setFeature(Parser.defaultAttributesFeature, false);
  4. parser.setFeature(Parser.useAttributes2Feature, true);

代码示例来源:origin: org.daisy.libs/com.xmlcalabash

  1. private XdmNode tagSoup(String text) {
  2. StringReader inputStream = new StringReader(text);
  3. InputSource source = new InputSource(inputStream);
  4. Parser parser = new Parser();
  5. parser.setEntityResolver(runtime.getResolver());
  6. SAXSource saxSource = new SAXSource(parser, source);
  7. DocumentBuilder builder = runtime.getProcessor().newDocumentBuilder();
  8. try {
  9. XdmNode doc = builder.build(saxSource);
  10. return doc;
  11. } catch (Exception e) {
  12. throw new XProcException(e);
  13. }
  14. }

代码示例来源:origin: rest-assured/rest-assured

  1. slurper = new XmlSlurper(config.isValidating(), config.isNamespaceAware(), config.isAllowDocTypeDeclaration());
  2. } else {
  3. XMLReader p = new org.ccil.cowan.tagsoup.Parser();
  4. slurper = new XmlSlurper(p);

代码示例来源:origin: net.sf.ofx4j/ofx4j

  1. public void parseV1FromFirstElement(Reader reader) throws IOException, OFXParseException {
  2. Parser parser = new Parser();
  3. try {
  4. parser.setFeature(Parser.restartElementsFeature, false);
  5. }
  6. catch (Exception e) {
  7. throw new OFXParseException(e);
  8. }
  9. parser.setContentHandler(new TagSoupHandler(getContentHandler()));
  10. try {
  11. parser.parse(new InputSource(reader));
  12. }
  13. catch (SAXException e) {
  14. if (e.getCause() instanceof OFXParseException) {
  15. throw (OFXParseException) e.getCause();
  16. }
  17. throw new OFXParseException("Error parsing OFX document.", e);
  18. }
  19. }

代码示例来源:origin: gamesbyangelina/spritely

  1. URL u = new URL("http://commons.wikimedia.org/wiki/" + query);
  2. Parser p = new Parser();
  3. p.setContentHandler(scraper);
  4. HttpResponse response = client.execute(getRequest);
  5. p.parse(new InputSource(response.getEntity().getContent()));

代码示例来源:origin: trezor/trezor-android

  1. /**
  2. * Returns displayable styled text from the provided HTML string.
  3. * Any &lt;img&gt; tags in the HTML will use the specified ImageGetter
  4. * to request a representation of the image (use null if you don't
  5. * want this) and the specified TagHandler to handle unknown tags
  6. * (specify null if you don't want this).
  7. *
  8. * <p>This uses TagSoup to handle real HTML, including all of the brokenness found in the wild.
  9. */
  10. public static Spanned fromHtml(String source, ImageGetter imageGetter,
  11. TagHandler tagHandler) {
  12. Parser parser = new Parser();
  13. try {
  14. parser.setProperty(Parser.schemaProperty, HtmlParser.schema);
  15. } catch (org.xml.sax.SAXNotRecognizedException e) {
  16. // Should not happen.
  17. throw new RuntimeException(e);
  18. } catch (org.xml.sax.SAXNotSupportedException e) {
  19. // Should not happen.
  20. throw new RuntimeException(e);
  21. }
  22. HtmlToSpannedConverter converter =
  23. new HtmlToSpannedConverter(source, imageGetter, tagHandler,
  24. parser);
  25. return converter.convert();
  26. }

代码示例来源:origin: org.ccil.cowan.tagsoup/tagsoup

  1. protected SAXParserImpl() // used by factory, for prototypes
  2. {
  3. super();
  4. parser = new org.ccil.cowan.tagsoup.Parser();
  5. }

代码示例来源:origin: stoicflame/ofx4j

  1. public void parseV1FromFirstElement(Reader reader) throws IOException, OFXParseException {
  2. Parser parser = new Parser();
  3. try {
  4. parser.setFeature(Parser.restartElementsFeature, false);
  5. }
  6. catch (Exception e) {
  7. throw new OFXParseException(e);
  8. }
  9. parser.setContentHandler(new TagSoupHandler(getContentHandler()));
  10. try {
  11. parser.parse(new InputSource(reader));
  12. }
  13. catch (SAXException e) {
  14. if (e.getCause() instanceof OFXParseException) {
  15. throw (OFXParseException) e.getCause();
  16. }
  17. throw new OFXParseException("Error parsing OFX document.", e);
  18. }
  19. }

代码示例来源:origin: zulip/zulip-android

  1. final Context context = app.getApplicationContext();
  2. final float density = context.getResources().getDisplayMetrics().density;
  3. Parser parser = new Parser();
  4. try {
  5. parser.setProperty(Parser.schemaProperty, schema);
  6. } catch (SAXNotRecognizedException | SAXNotSupportedException e) {

相关文章