USA Address standardization of addresses

  • Статус: Pending
  • Награды: $1000
  • Полученные заявки: 4

Краткое описание конкурса

the short version of task, is i am looking to create a standard format for address , i have various data sources, all with data in different fields, so i want to be able to pull in different a address field layout, and convert it to my newly created standard form. when doing this , i want to track, which step of code corrected this particular record, so that i can fine tune, or make chages to that step

Attached are 3 files:

File 1: “street address variation” , contains examples of data entered ,without strict rules or general fields and not specific fields for each data type. This is sample of many different data records, which are in fact the same address, but do not match.

File 2: FL LEON SITUS FREELANCER, IS control data, this file is assumed to have the correct data, and is broken down each field of a usa address. We assume this data is correct, although im sure it contains errors, it will serve as the basis of creating a standard format.

File 3: FL LEON CERTIFIED DATA, FREE, is file that needs to be compared to the control file. This file is a typical generalized format address lines of 4, to enter in usa address. As you can see based on file 1 this data can vary quite a bit on how it is entered.

File 4: OUTPUT SUMMARY: Please look at, has 4 sheets, 1 is the desired output of corrections made, 2 version table (not that important), 3 db table for new address along with fields for manual review. 4, is street type or street suffix type abbreviations.

Various data sources will have different format in how they collect data. Task is to take control data, make a general format out of it, then make combine format of address lines 1-4 in File 3,and compare to see matches. 1st develop a basic manipulation of data to get highest results of a direct match with file 3, then Start testing changes to data in file 3 to make match to the newly created format of data in file 2, start with simplest changes first. Lets say street type= column “c ir” should be changed to “CIR”. This is simple change with limited possibilities,

Example of street type different, witchtree acrs, and witchtree acres are same
Also examples of Situs corrections to be made: there are 4502 records that have no street name info, these records can be left alone, record 21435, adrno= 4085, adrstr= buck lake, the rd is in adrsuf2, and should be in adrsuf, record 133657, has street type “c ir” instead of “cir”

Please feel free to ask as you have questions for clarification. prefer C or php, mysql,

Рекомендуемые навыки

Лучшие заявки этого конкурса

Показать больше конкурсных работ

Панель общих вопросов

  • BUZIO66
    BUZIO66
    • 2 недель назад

    I am a bit confused and now I try to clarify my previous question by reformulating the project's goal

    Target: create a standard format for US address (at the moment I suppose that the best candidate is the format proposed in the "New exit address", and therefore I need to know exactly the meaning of its fields)
    namely: starting from different data sources, the goal is to create a batch process (in some language) that transforms the starting data into the standard format adopted.

    Quality control: whenever the current batch process fails to generate an acceptable output, I need to be able to determine the change to be made to solve the problem.

    Correct?

    • 2 недель назад
  • BUZIO66
    BUZIO66
    • 3 недель назад

    Hi, can you give me an example of populating the "New exit address" tab?

    • 3 недель назад
  • QualityComponent
    QualityComponent
    • 3 недель назад

    Hay, I have been working with Excel for so many years, I think without having detailed discussion it is difficult to build the standard. I suggest if we can discussion and you give me little walk-though of the sheets and data, it will help us achieve the output without reject.

    • 3 недель назад
    1. allygoood
      Организатор конкурса
      • 3 недель назад

      please make dummy entry so we can chat

      • 3 недель назад
  • hellocodepolicy
    hellocodepolicy
    • 3 недель назад

    Hay, Can we discuss, we need little walk-though of the sheets and data because without having detailed discussion it is difficult to build the standard. It will help us achieve the output without reject.

    • 3 недель назад
    1. allygoood
      Организатор конкурса
      • 3 недель назад

      please make dummy entry so we can chat

      • 3 недель назад
  • allygoood
    Организатор конкурса
    • 4 недель назад

    I encourage you to contact me with questions. Submission should be a summary of records corrected overall, not matched etc along with which step of algorithm they were corrected by. a brief overall explanation of how you went about solving, and a brief description of each step of standardizing process.

    • 4 недель назад
  • yhf8377
    yhf8377
    • 4 недель назад

    Just wondering what do you want in the submission image? A screenshot of the working program, a screenshot of the end results, or something else?

    • 4 недель назад
  • allygoood
    Организатор конкурса
    • 1 месяц назад

    If anybody would like to chat, please me make dummy entry, and ask me to chat. thank you

    • 1 месяц назад
    1. phjocoronel
      phjocoronel
      • 1 месяц назад

      hi you speak english or spanish?

      • 1 месяц назад
  • ABlackthorn
    ABlackthorn
    • 1 месяц назад

    Does the project have to be written in C? Or can it be in C++?

    • 1 месяц назад
    1. allygoood
      Организатор конкурса
      • 1 месяц назад

      c++ is ok

      • 1 месяц назад
  • vw7341434vw
    vw7341434vw
    • 1 месяц назад

    Can it be assumed the ADDRESS1 -4 are always the address fields? Can the number of entries per line by assumed < 26? Length of character strings < 64 characters? For C, command line program that feeds the input and output file names okay? Is the examples for address fractions the range - E.G. STE, SUITE complete? I see CT and WAY etc. Can rules be made to limit the possibilities? Does the output file need to sorted, if so, by what field?

    • 1 месяц назад
  • csa578e9eda66603
    csa578e9eda66603
    • 1 месяц назад

    Hello, C programming language is ideal for console applications without graphical interface. PHP programming language is ideal only for web applications. For desktop applications with graphical interface JAVA, C #, VB or Access are ideal.
    Question 1. Do you want a desktop application with graphical interface or console application or web application?
    I understand that I must read and verify the FL LEON Certified_Data FREE.csv file. Then generate a new file FL LEON Certified_Data FREE-Verified.csv with the necessary corrections.
    In order to obtain this new FL LEON Certified_Data FREE-Verified.csv file, I must use the STREET ADDRESS VARIATION.xlsx file and the FL LEON SITUS FREELANCER.xlsx file to verify if the correct "word" and not its variations was used.
    Question 2. Am I correct?

    • 1 месяц назад
  • ABlackthorn
    ABlackthorn
    • 1 месяц назад

    Hello, this seems easily doable, I just don't exactly understand all of what is asked in task 1.
    You want a program that takes input from the 2 FL LEON files and consolidates them into one file, with taking in account the variations that addresses may have? Is that it?

    • 1 месяц назад
    1. allygoood
      Организатор конкурса
      • 1 месяц назад

      please re read, and ask if questions not answered.

      • 1 месяц назад
  • UGINTL
    UGINTL
    • 1 месяц назад

    A little more description would be more helpful. Especially about the files attached. You should be specific about which file does what and what to do with them using the names of the files.

    • 1 месяц назад
  • allygoood
    Организатор конкурса
    • 1 месяц назад

    task is to standardize the addresses, situs is control, and assumed to be correct, other file is to be manipulated to get a high degree of match to control file

    • 1 месяц назад

Показать больше комментариев

С чего начать в организации конкурсов

  • Опубликуйте свой конкурс

    Опубликовать Ваш конкурс Быстро и легко

  • Получайте тонны заявок

    Получайте тонны заявок Со всего мира

  • Присудите приз лучшей заявке

    Присудите приз лучшей заявке Загрузите файлы - это легко!

Опубликуйте конкурс или присоединяйтесь к нам уже сегодня!