How to use
getTemplates
method
in
de.tudarmstadt.ukp.wikipedia.parser.ParsedPage

Best Java code snippets using de.tudarmstadt.ukp.wikipedia.parser.ParsedPage.getTemplates (Showing top 7 results out of 315)

/**
 *
 * Returns the set of names of all templates that are contained in the given
 * article (without duplicates).<br>
 *
 * Note: The names are SQL escaped using {@link StringUtils#sqlEscape(String)}.
 *
 * @param pageText
 *            the page to get the templates from
 * @return a set of template names (without duplicates)
 */
private Set<String> getTemplateNames(String pageText)
{
  Set<String> names = new HashSet<String>();
  if (!pageText.isEmpty()) {
    try {
      ParsedPage pp = parser.parse(pageText);
      List<Template> templates = pp.getTemplates();
      for (Template t : templates) {
        names.add(StringUtils.sqlEscape(t.getName().toLowerCase()));
      }
    }
    catch (Exception e) {
      // Most likely parsing problems
      logger.error("Problems parsing page!", e);
    }
  }
  return names;
}

private void setTemplates(Article article, ParsedPage page) {
  final List<Template> templates = new ArrayList<Template>(10);
  for (final de.tudarmstadt.ukp.wikipedia.parser.Template t : page
      .getTemplates()) {
    final List<String> templateParameters = t.getParameters();
    parseTemplatesSchema(article, templateParameters);
    if (t.getName().toLowerCase().startsWith("infobox")) {
      article.setInfobox(new Template(t.getName(), templateParameters));
    } else {
      templates.add(new Template(t.getName(), templateParameters));
    }
      }
  article.setTemplates(templates);
}

tplLoop:for(Template tpl:pp.getTemplates()){
  if(tpl.getName().equalsIgnoreCase(templateName)){
    containsTpl=true;

/**
 * Does the same as revisionContainsTemplateFragment() without using a template index
 *
 * @param revId
 * @param templateFragment
 * @return
 * @throws WikiApiException
 */
public boolean revisionContainsTemplateFragmentWithoutIndex(int revId, String templateFragment) throws WikiApiException{
  if(revApi==null){
    revApi = new RevisionApi(wiki.getDatabaseConfiguration());
  }
  if(parser==null){
    //TODO switch to SWEBLE
    MediaWikiParserFactory pf = new MediaWikiParserFactory(
        wiki.getDatabaseConfiguration().getLanguage());
    pf.setTemplateParserClass(ShowTemplateNamesAndParameters.class);
    parser = pf.createParser();
  }
  List<Template> tplList = parser.parse(revApi.getRevision(revId).getRevisionText()).getTemplates();
  for(Template tpl:tplList){
    if(tpl.getName().toLowerCase().startsWith(templateFragment.toLowerCase())){
      return true;
    }
  }
  return false;
}

for( Template t: pp.getTemplates()){
  nrOfTemplates++;
  String templateName = t.getName().toLowerCase();

if( pp.nrOfNestedLists() != 0 )nrOfPagesWithNl++;
if( pp.nrOfTables() != 0 ) nrOfPagesWithTables++;
if( pp.getTemplates().size() != 0 ) nrOfPagesWithTemplates++;
if( pp.getSections().size()>1 )nrOfPagesWithSubSections++;

/**
 * Does the same as revisionContainsTemplateName() without using a template index
 *
 * @param revId
 * @param templateName
 * @return
 * @throws WikiApiException
 */
public boolean revisionContainsTemplateNameWithoutIndex(int revId, String templateName) throws WikiApiException{
  if(revApi==null){
    revApi = new RevisionApi(wiki.getDatabaseConfiguration());
  }
  if(parser==null){
    //TODO switch to SWEBLE
    MediaWikiParserFactory pf = new MediaWikiParserFactory(
        wiki.getDatabaseConfiguration().getLanguage());
    pf.setTemplateParserClass(ShowTemplateNamesAndParameters.class);
    parser = pf.createParser();
  }
  List<Template> tplList = parser.parse(revApi.getRevision(revId).getRevisionText()).getTemplates();
  for(Template tpl:tplList){
    if(tpl.getName().equalsIgnoreCase(templateName)){
      return true;
    }
  }
  return false;
}

Javadoc

Returns a list of templates that are used in the page.

Popular methods of ParsedPage

getLinks
getSections
getText
getLanguages
getFirstParagraph
getLanguagesElement
getCategories
Returns a list of category Link objects. This is a shortcut for writing getCategoryElemement.getLink
getFirstParagraphNr
Returns the number of the first paragraph.
getNestedLists
Returns a list of nested lists.
getParagraph
Returns the paragraph indicated by the parameter i.
getSection
getTables
Returns a list of tables.

Popular in Java

Making http post requests using okhttp
requestLocationUpdates (LocationManager)
getExternalFilesDir (Context)
getOriginalFilename (MultipartFile)
Return the original filename in the client's filesystem.This may contain path information depending
FileWriter (java.io)
A specialized Writer that writes to a file in the file system. All write requests made by calling me
ConnectException (java.net)
A ConnectException is thrown if a connection cannot be established to a remote host on a specific po
StringTokenizer (java.util)
Breaks a string into tokens; new code should probably use String#split.> // Legacy code: StringTo
IOUtils (org.apache.commons.io)
General IO stream manipulation utilities. This class provides static utility methods for input/outpu
Color (java.awt)
The Color class is used to encapsulate colors in the default sRGB color space or colors in arbitrary
Loader (org.hibernate.loader)
Abstract superclass of object loading (and querying) strategies. This class implements useful common
From CI to AI: The AI layer in your organization

How to use getTemplatesmethodin de.tudarmstadt.ukp.wikipedia.parser.ParsedPage

Best Java code snippets using de.tudarmstadt.ukp.wikipedia.parser.ParsedPage.getTemplates (Showing top 7 results out of 315)

How to use
getTemplates
method
in
de.tudarmstadt.ukp.wikipedia.parser.ParsedPage