Tabnine Logo
PDFParser.extractMultilingualItems
Code IndexAdd Tabnine to your IDE (free)

How to use
extractMultilingualItems
method
in
org.apache.tika.parser.pdf.PDFParser

Best Java code snippets using org.apache.tika.parser.pdf.PDFParser.extractMultilingualItems (Showing top 3 results out of 315)

origin: apache/tika

extractMultilingualItems(metadata, TikaCoreProperties.TITLE, info.getTitle(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_TITLE, info.getTitle());
extractDublinCoreListItems(metadata, TikaCoreProperties.CREATOR, info.getAuthor(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_KEY_WORDS, info.getKeywords());
addMetadata(metadata, PDF.DOC_INFO_PRODUCER, info.getProducer());
extractMultilingualItems(metadata, TikaCoreProperties.DESCRIPTION, null, dcSchema);
origin: com.github.lafa.tikaNoExternal/tika-external

extractMultilingualItems(metadata, TikaCoreProperties.TITLE, info.getTitle(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_TITLE, info.getTitle());
extractDublinCoreListItems(metadata, TikaCoreProperties.CREATOR, info.getAuthor(), dcSchema);
addMetadata(metadata, "producer", info.getProducer());
addMetadata(metadata, PDF.DOC_INFO_PRODUCER, info.getProducer());
extractMultilingualItems(metadata, TikaCoreProperties.DESCRIPTION, null, dcSchema);
origin: org.apache.tika/tika-parsers

extractMultilingualItems(metadata, TikaCoreProperties.TITLE, info.getTitle(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_TITLE, info.getTitle());
extractDublinCoreListItems(metadata, TikaCoreProperties.CREATOR, info.getAuthor(), dcSchema);
addMetadata(metadata, "producer", info.getProducer());
addMetadata(metadata, PDF.DOC_INFO_PRODUCER, info.getProducer());
extractMultilingualItems(metadata, TikaCoreProperties.DESCRIPTION, null, dcSchema);
org.apache.tika.parser.pdfPDFParserextractMultilingualItems

Javadoc

Try to extract all multilingual items from the XMPSchema

This relies on the property having a valid xmp getName()

For now, this only extracts the first language if the property does not allow multiple values (see TIKA-1295)

Popular methods of PDFParser

  • <init>
  • parse
  • addMetadata
  • decode
  • extractDublinCoreListItems
    This tries to read a list from a particular property in XMPSchemaDublinCore. If it can't find the in
  • extractMetadata
  • getPassword
  • getXMPBagOrSeqList
    As of this writing, XMPSchema can contain bags or sequence lists for some attributes...despite stand
  • handleXFAOnly
  • loadDOM
  • shouldHandleXFAOnly
  • getDocument
  • shouldHandleXFAOnly,
  • getDocument,
  • getPDDocument,
  • setInitializableProblemHandler,
  • setOcrStrategy,
  • setSortByPosition,
  • setTempDirectory

Popular in Java

  • Making http requests using okhttp
  • compareTo (BigDecimal)
  • setRequestProperty (URLConnection)
  • addToBackStack (FragmentTransaction)
  • Container (java.awt)
    A generic Abstract Window Toolkit(AWT) container object is a component that can contain other AWT co
  • RandomAccessFile (java.io)
    Allows reading from and writing to a file in a random-access manner. This is different from the uni-
  • URL (java.net)
    A Uniform Resource Locator that identifies the location of an Internet resource as specified by RFC
  • Arrays (java.util)
    This class contains various methods for manipulating arrays (such as sorting and searching). This cl
  • CountDownLatch (java.util.concurrent)
    A synchronization aid that allows one or more threads to wait until a set of operations being perfor
  • Table (org.hibernate.mapping)
    A relational table
  • Best IntelliJ plugins
Tabnine Logo
  • Products

    Search for Java codeSearch for JavaScript code
  • IDE Plugins

    IntelliJ IDEAWebStormVisual StudioAndroid StudioEclipseVisual Studio CodePyCharmSublime TextPhpStormVimGoLandRubyMineEmacsJupyter NotebookJupyter LabRiderDataGripAppCode
  • Company

    About UsContact UsCareers
  • Resources

    FAQBlogTabnine AcademyTerms of usePrivacy policyJava Code IndexJavascript Code Index
Get Tabnine for your IDE now