Filter Search Results
Content the user has created or edited.
Clear Filter

Showing 1-10 of 87 for Tika

  • Tika Summary Purpose Detects and extracts metadata and text content from documents. Homepage http://tika.apache.org/ Source Code Repository https://github.com/apache/tika License Apache License, Version 2.0 Debian Package Description Java based tool for detecting and extracting metadata and text content from documents
  • EVAL ARC2WARCTOMAR with Tika Evaluator(s) Sven Schlarb <[email protected]> Evaluation points Assessment of measurable points Metric Description Metric baseline Metric goal March 04, 2014 (1000) March 04, 2014 (4924) March 04, 2014 (9856) NumberOfObjectsPerHour Number of objects processed in one hour 545,17246
  • SO11 The Tika characterisation Tool Title The Tika Characterisation Tool Detailed description Tika have been chosen as a useful tool for the PC CC workpackage … Apache TIKA 1.0 workflow http://www.myexperiment.org/workflows/2583.html Simple Tika RESTful web service workflow http://www.myexperiment.org/workflows/2660.html
  • Tika Batch File Identification Title Tika Batch File Identification Detailed description {}Overview: Group of issues surrounding batch processing of large … into 3 blocks: 1. Recursively run Apache Tika http://wiki.opflabs.org/display/TR/Tika over all files in a specified directory and save output metadata
  • Example Working with Apache Tika This is based on this example from SpringSource https://github.com/SpringSource/springsocial/wiki/Contributing, and explains how I collaborated with Apache Tika on issue TIKA849 https://issues.apache.org/jira/browse/TIKA849: The critical difference is that Apache Tika Git is generated
  • Extracting and aggregating metadata with Apache Tika Extracting and aggregating metadata with Tika At the Glasgow Mashup Peter May created a Python wrapper for Apache Tika. Carl Wilson extended this work, creating a Java utility class that wrapped Tika, providing simple configuration, two types of call to Tika (simple
  • Parsing PST OST file using TIKA Title Parsing PST OST file using TIKA Detailed description The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. This solution uses Tika toolkit. http://tika.apache.org/ Fields used from the email
  • Large scale document characterization and identification with Tika and DRIOID on SCAPE Azure platform Status Active Contact Ivan Vujic [email protected] … the speed of the Apache Tika Content Analysis Toolkit and the DROID File Format Identification Tool when they were run on a Microsoft Azure virtual machine
  • SO17 Web Archive MimeType detection workflow based on Droid and Apache Tika Title SO17 Web Archive MimeType detection workflow based on Droid and Apache Tika … distribution list in XLS format. The workflow exists in two versions: ) One using the TIFOWA tool (by ONB) utilizing the Apache TIKA 0.7 API. ) One using DROID
  • PC.WP1 Tool tracker Tika package by BL UNIX file package by ? FITS package by ? ffprobe package by SB DROID? Tika UNIX file FITS ffprobe DROID