Tabnine Logo
PDFParser.extractDublinCoreListItems
Code IndexAdd Tabnine to your IDE (free)

How to use
extractDublinCoreListItems
method
in
org.apache.tika.parser.pdf.PDFParser

Best Java code snippets using org.apache.tika.parser.pdf.PDFParser.extractDublinCoreListItems (Showing top 3 results out of 315)

origin: apache/tika

extractMultilingualItems(metadata, TikaCoreProperties.TITLE, info.getTitle(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_TITLE, info.getTitle());
extractDublinCoreListItems(metadata, TikaCoreProperties.CREATOR, info.getAuthor(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_CREATOR, info.getAuthor());
extractDublinCoreListItems(metadata, TikaCoreProperties.CONTRIBUTOR, null, dcSchema);
addMetadata(metadata, TikaCoreProperties.CREATOR_TOOL, info.getCreator());
addMetadata(metadata, PDF.DOC_INFO_CREATOR_TOOL, info.getCreator());
origin: com.github.lafa.tikaNoExternal/tika-external

extractMultilingualItems(metadata, TikaCoreProperties.TITLE, info.getTitle(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_TITLE, info.getTitle());
extractDublinCoreListItems(metadata, TikaCoreProperties.CREATOR, info.getAuthor(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_CREATOR, info.getAuthor());
extractDublinCoreListItems(metadata, TikaCoreProperties.CONTRIBUTOR, null, dcSchema);
addMetadata(metadata, TikaCoreProperties.CREATOR_TOOL, info.getCreator());
addMetadata(metadata, PDF.DOC_INFO_CREATOR_TOOL, info.getCreator());
origin: org.apache.tika/tika-parsers

extractMultilingualItems(metadata, TikaCoreProperties.TITLE, info.getTitle(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_TITLE, info.getTitle());
extractDublinCoreListItems(metadata, TikaCoreProperties.CREATOR, info.getAuthor(), dcSchema);
addMetadata(metadata, PDF.DOC_INFO_CREATOR, info.getAuthor());
extractDublinCoreListItems(metadata, TikaCoreProperties.CONTRIBUTOR, null, dcSchema);
addMetadata(metadata, TikaCoreProperties.CREATOR_TOOL, info.getCreator());
addMetadata(metadata, PDF.DOC_INFO_CREATOR_TOOL, info.getCreator());
org.apache.tika.parser.pdfPDFParserextractDublinCoreListItems

Javadoc

This tries to read a list from a particular property in XMPSchemaDublinCore. If it can't find the information, it falls back to the pdfboxBaseline. The pdfboxBaseline should be the value that pdfbox returns from its PDDocumentInformation object (e.g. getAuthor()) This method is designed include the pdfboxBaseline, and it should not duplicate the pdfboxBaseline.

Until PDFBOX-1803/TIKA-1233 are fixed, do not call this on dates!

This relies on the property having a DublinCore compliant getName()

Popular methods of PDFParser

  • <init>
  • parse
  • addMetadata
  • decode
  • extractMetadata
  • extractMultilingualItems
    Try to extract all multilingual items from the XMPSchema This relies on the property having a valid
  • getPassword
  • getXMPBagOrSeqList
    As of this writing, XMPSchema can contain bags or sequence lists for some attributes...despite stand
  • handleXFAOnly
  • loadDOM
  • shouldHandleXFAOnly
  • getDocument
  • shouldHandleXFAOnly,
  • getDocument,
  • getPDDocument,
  • setInitializableProblemHandler,
  • setOcrStrategy,
  • setSortByPosition,
  • setTempDirectory

Popular in Java

  • Making http post requests using okhttp
  • setContentView (Activity)
  • getSharedPreferences (Context)
  • findViewById (Activity)
  • Rectangle (java.awt)
    A Rectangle specifies an area in a coordinate space that is enclosed by the Rectangle object's top-
  • ConnectException (java.net)
    A ConnectException is thrown if a connection cannot be established to a remote host on a specific po
  • HttpURLConnection (java.net)
    An URLConnection for HTTP (RFC 2616 [http://tools.ietf.org/html/rfc2616]) used to send and receive d
  • SocketTimeoutException (java.net)
    This exception is thrown when a timeout expired on a socket read or accept operation.
  • BasicDataSource (org.apache.commons.dbcp)
    Basic implementation of javax.sql.DataSource that is configured via JavaBeans properties. This is no
  • Runner (org.openjdk.jmh.runner)
  • CodeWhisperer alternatives
Tabnine Logo
  • Products

    Search for Java codeSearch for JavaScript code
  • IDE Plugins

    IntelliJ IDEAWebStormVisual StudioAndroid StudioEclipseVisual Studio CodePyCharmSublime TextPhpStormVimGoLandRubyMineEmacsJupyter NotebookJupyter LabRiderDataGripAppCode
  • Company

    About UsContact UsCareers
  • Resources

    FAQBlogTabnine AcademyTerms of usePrivacy policyJava Code IndexJavascript Code Index
Get Tabnine for your IDE now