Hi i am trying to do a mapside join in crunch using MapsideJoinStrategy class. It is working fine for inner join but it gives this error for full outer join :" Join type FULL_OUTER_JOIN not supported by MapsideJoinStrategy"
How to do a Map side full outer join in Apache Crunch ( Join type FULL_OUTER_JOIN not supported by MapsideJoinStrategy )
709 views Asked by user3500433 At
1
There are 1 answers
Related Questions in HADOOP
- pcap to Avro on Hadoop
- schedule and automate sqoop import/export tasks
- How to diagnose Kafka topics failing globally to be found
- Only 32 bit available in Oracle VM - Hadoop Installation
- Using HDFS with Apache Spark on Amazon EC2
- How to get raw hadoop metrics
- How to output multiple values with the same key in reducer?
- Loading chararray from embedded JSON using Pig
- Oozie Pig action stuck in PREP state and job is in RUNNING state
- InstanceProfile is required for creating cluster - create python function to install module
Related Questions in MAPREDUCE
- pcap to Avro on Hadoop
- CouchDB sum by date range and type
- How to output multiple values with the same key in reducer?
- mapreduce job not setting compression codec correctly
- Split S3 files into multiple output files
- groupByKey not properly working in spark
- MapReduce job fails with ExitCodeException exitCode=255
- What is better way to send associative array through map/reduce at MongoDB?
- How to efficiently join two files using Hadoop?
- null pointer exception in getstrings method hadoop
Related Questions in APACHE-CRUNCH
- Configuring number of reducers for a particular Dofn in Apache crunch
- org.apache.crunch.CrunchRuntimeException: java.io.NotSerializableException
- How to trace the origin of "<init>()V" failures in Avro?
- WordCount with Apache Crunch into HBase Standalone
- Hadoop Job: Error injecting constructor, JAXBException
- How to write output of Apache Crunch to Amazon S3 bucket
- Can Apache Crunch be used to create Graph like data structure?
- How to do a Map side full outer join in Apache Crunch ( Join type FULL_OUTER_JOIN not supported by MapsideJoinStrategy )
- How to run Apache Crunch application without a Hadoop?
- Could not find or load main class while trying to run project from IntelliJ
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Popular Tags
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
MapsideJoinStrategy can not perform RIGHT_OUTER_JOIN and so FULL_OUTER_JOIN. It is impossible by design. Whole work happens in mappers (no reduce phase). Since there can be more than one mapper it is not possible to determine which key from right-side will not have matching key on left-side, because single mapper will not see whole left-side data.
For FULL_OUTER_JOIN use DefaultJoinStrategy.
I've extended BloomFilterJoinStrategy to suport all join types. Here is pull request @ GitHub.