Why is Fetch task in Hive works faster than Map-only task?

FetchTask directly fetches data, whereas Mapreduce will invoke a map reduce job

<property>
  <name>hive.fetch.task.conversion</name>
  <value>minimal</value>
  <description>
    Some select queries can be converted to single FETCH task 
    minimizing latency.Currently the query should be single 
    sourced not having any subquery and should not have
    any aggregations or distincts (which incurrs RS), 
    lateral views and joins.
    1. minimal : SELECT STAR, FILTER on partition columns, LIMIT only
    2. more    : SELECT, FILTER, LIMIT only (+TABLESAMPLE, virtual columns)
  </description>
</property>

Run code snippetExpand snippet

Also there is another parameter hive.fetch.task.conversion.threshold which by default in 0.10-0.13 is -1 and >0.14 is 1G(1073741824) This indicates that, If table size is greater than 1G use Mapreduce instead of Fetch task

more detail

Hive: how to show all partitions of a table?
fs.hdfs.impl.disable.cache caused SparkSQL very slow
Hive dynamic partitioning
Difference between hive.tez.container.size and tez.task.resource.memory.mb
how to write subquery and use “In” Clause in Hive
get “ERROR: Can’t get master address from ZooKeeper; znode data == null” when using Hbase shell
Failed to locate the winutils binary in the hadoop binary path
Name node is in safe mode. Not able to leave
java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient
First hadoop project error: “Input path does not exist”
Hive Map-Join configuration mystery
Difference between `load data inpath ` and `location` in hive?
What is Keyword Context in Hadoop programming world?
How can you profile a Python script?
What should be hadoop.tmp.dir ?
Hive’s unix_timestamp and from_unixtime functions
Where is the classpath set for hadoop
SQL: How to properly check if a record exists
Fastest way to put contents of Set to a single String with words separated by a whitespace?
Optimize post insert and delete for bulk operations?
WordPress (MyISAM) database is slow, should I switch to InnoDB?
How to benchmark a WordPress installation? [closed]
Set Alias for meta_query arguments in get_posts()
How do I optimize a custom post type admin page with 25,000 posts?
Why is home (a lot) slower than other pages?
What’s the case against transient-ing almost everything that’s mostly static?
Optimize Multiple Taxonomy Term MySQL Query?
Where should I host my images?
How to remove in the wordpress database all posts revisions except the last three?
Easy way to process search results before displaying
How to solve slow WordPress site caused by attachment_metadata
Minimal WordPress load for only `get_option` to work (because ajax…)
Is there a way to measure server resource (CPU) usage by WP plugins?
Speed up WordPress
If I consider changing my WordPress DB tables to InnoDB, will it have an effect on the way WordPress works?
Splitting the main query in multiple loops with query_posts and/or pre_get_posts?
Image sizes and order of operations
WordPress Query is taking more then 20 second and stuck on creating index
Publish a message on facebook after having posted a comment
Can’t move jQuery to footer
What’s faster? One big query, or several smaller ones?
WordPress hosting optimized servers – Is this just a sales gimmick? [closed]
I’m designing a plugin to create database indexes. Suggestions?
wordpress with 1.5 million posts
WP Responsive images – upload custom image sizes
Database Queries Optimization with new WP_Query
How to optimize multiple WP_Query() calls on one page?
Improve performance by removing unnecessary database queries
Cron Job Keep Running in spite of being disabled
Least expensive way to get table prefix in multisite installation
Optimizing site speed by localizing paths
get_the_id, get_the_permalink, and get_the_title all with one DB call
How to set a Cookie-Free Domain with WordPress?
How effective are cacheing plugins for dynamic pages?
better wordpress minify problem
Combining several CSS files into one for optimization
Prioritize visible content – Page speed issue on Google insights
WordPress network vs Separate installs
How to move core js files into the footer
Can the benefits of performance optimization plugins outweigh the tax of installing them on performance?
WordPress plugin activation, deactivation and uninstall hook not being triggered
Is querying wpdb directly and skipping actions provided by WP’s core “wp_update_post” a good idea?
How to find and remove unnecessary theme scripts? [closed]
Queries take 120+ seconds on my large WordPress site
Optimizing function that automatically creates internal links based on post title string
Optimising uploads folder then re-uploading?
A faster way to query custom post types with multiple conditions?
Function to retrieve IDs of posts, cache results, and improve wp_query
How do I minify ‘@import’ css files with wp-minify?
How to add some basic inline CSS using existing plugin or theme?
My authors need to be able to preview their upload images and manipulate and scale
Obfuscate links (for SEO purpose) [closed]
Optimal image size for wordpress?
Lazy Loading of all Images
Can mediaelement-and-play core JS be dequeued?
WordPress UPDATE queries on MySQL database stuck
What is the best way to host a site with many images without it being too slow?
Does &$this is really disallowed to use anywhere?
Need to optimize wordpress websites on google pageinsight
Can lots of tags affect performance?
How can I stop widgets from re-executing every time I access the logged-in homepage?
Number of queries in wordpress
How to optimise this database query?
Is there a way to analyze the load time of the functions in a custom plugin?
Optimize WP_Query
Optimising amount of calls to custom fields
Combine several CSS files into one
Cache metadata for set of posts
What causes a theme to be inefficient?
My website is slow on the initial load
Lots of SQL queries
How to make wordpress backend mobile optimized.?
echo vs output variable [closed]
What are some ways to speed up a WordPress site that most people don’t know? [closed]
Delete old thumbnail when updating new
Serve images as webp if browser support
How to optimize recaptcha__en.js
How can I improve and optimise my wordpress web server for better performance in 2023
Optimize blog that serves hundred of images and videos
WordPress – Optimize the Meta Query for 3 meta keys at a time

Related Posts:

Leave a Comment Cancel reply