ENTRY DATA SCIENTIST

The Opportunity:

Croptix headquartered in State College, PA, offers farmers a practical, affordable crop disease screening solution that “sees” the symptoms of plant disease inside of leaves before those symptoms are visible to the human eye and before disease has time to spread. The services leverage advanced analytics coupled with field deployable sensor technology to equip farmers with the tools to diagnose crop disease early and take effective action to reduce losses due to disease. Currently Croptix is focused on providing a service solution to citrus farmers dealing with devastating citrus diseases including citrus HuangLongBing disease, which has no cure and has plagued citrus production globally.

 

Croptix is growing fast and looking for new team members who are passionate about bringing technology solutions to agriculture to improve and protect our food production. If you are passionate about protecting our food crops, you may be a good fit for our team. We need creative problem solvers with strong project management skills who meet deadlines while working in a fast-paced, dynamic environment.

Croptix is looking to immediately hire a full-time entry level Data Scientist to join our team. This is an entry level data team position with room to learn and grow as Croptix grows. The Data Scientist is primarily tasked with improving current Croptix production data models and researching and building new models based on latest ML and AI techniques. Secondary tasks include improving the data preprocessing pipeline through development of new tools or algorithms. The Data Scientist will work closely with other team members and will make significant contributions to the development and operation of the Croptix platform. 

 

If you are excited about the position and feel that you could bring valuable experience to our team, we encourage you to apply.

Job Duties and Responsibilities:

  • Working on independent projects as part of a data analytics team.

  • Building advanced statistical and machine learning models to help improve our disease-classifying algorithms. 

  • Developing tools to help us process data more efficiently.

Minimum Requirements-Education, Skills and Qualifications:

  • B.S. degree in Math, Statistics or equivalent is a minimum education entry requirement; M.S./M.E. is preferred.

  • Python programming experience is a must (numpy, pandas, scikit-learn)

  • Familiarity with Jupyter, GitHub, Slack, Gmail, Google Calendar, Google Drive

  • Must have knowledge of statistics fundamentals: (generalized) linear models, mixed models, outlier detection

  • Must have experience cleaning, massaging, and organizing data for analysis.

  • Must have experience developing and applying statistical analysis including linear regression, principal components analysis, linear discriminant analysis

  • Experience implementing other statistical and ML models such as logistic regression, random forest, XGBoost

  • Communication skills: comfortable communicating through virtual presentations, in person meetings, and general conversations

  • Must demonstrate good organization/documentation practices with regards to annotating code, producing Jupyter notebooks, data team wiki-pages, task management, debugging, and issue ticketing.

  • Must be able to produce high quality and clearly communicated slide decks and other documents for team meetings and review

Preferred Additional Experience:

  • 1-3 years of work experience as a Data Analyst, Data Engineer, Data Scientist or similar

  • Familiarity with MySQL

  • Experience building Python packages

  • Experience working with spectral data, image processing, and/or signal processing

  • Experience conducting causality experiments by applying A/B experiments or other similar approaches to identify underlying problems of an observed result