Find similar rows in a Spark dataframe, based on euclidian distance and business rules.

Write code that does this :

- for each row : find 1 to 5 other rows that are most similar.

- similar rows need to have identical values for some features, and be in a certain interval for other features.

Навыки: Apache Spark, Scala

О клиенте:
( 0 отзыв(-а, -ов) ) Ecully, France

ID проекта: #33796299

3 фрилансеров(-а) готовы выполнить эту работу в среднем за €31


Hi, I have more that 7+ years of experience in Hadoop and data mining technologies like HDFS, MapReduce, python, pypark, Scala, Hive etc. Please review my profile for skills. contact me.

€50 EUR за 7 дней(-я)
(3 отзывов(-а))

Okay I can do this How I can get this work. Please let me know I will connect with you. Where I can connect.

€19 EUR за 7 дней(-я)
(0 отзывов(-а))

I have 3.6b years of experience in spark, knows APIs in depth and well versed with spark internals and functionality. Really interested in this task and will be table complete it ASAP using using 3plus years of industr Больше

€25 EUR за 2 дней(-я)
(0 отзывов(-а))