The Opportunity:

Croptix, headquartered in State College, PA, provides services to the agriculture industry for crop disease detection. The services leverage advanced analytics coupled with field deployable sensor technology to equip farmers with the tools to diagnose crop disease early and take effective action to reduce losses due to disease. The Croptix platform is available for citrus growers seeking to identify and manage the devastating citrus greening disease.


Croptix is looking to immediately hire a full-time entry level Data Scientist to join our team. This is an entry level data team position with room to learn and grow as Croptix grows. The Data Scientist is primarily tasked with improving current Croptix production data models and researching and building new models based on latest ML and AI techniques. Secondary tasks include improving the data preprocessing pipeline through development of new tools or algorithms. The Data Scientist will work closely with other team members and will make significant contributions to the development and operation of the Croptix platform. 


The qualifications listed below give a high-level overview of what we’re looking for in a Data Scientist. The position will start as remote due to covid-19 impacts, and there is opportunity to continue in a remote work arrangement post pandemic. If you are excited about the position and feel that you could bring valuable experience to our team, we encourage you to apply.

Job Duties and Responsibilities:

  • Working on independent projects as part of a data analytics team.

  • Building advanced statistical and machine learning models to help improve our disease-classifying algorithms. 

  • Developing tools to help us process data more efficiently.

Minimum Requirements-Education, Skills and Qualifications:

  • B.S. degree in Math, Statistics or equivalent is a minimum education entry requirement; M.S./M.E. is preferred.

  • Python programming experience is a must (numpy, pandas, scikit-learn)

  • Familiarity with Jupyter, GitHub, Slack, Gmail, Google Calendar, Google Drive

  • Must have knowledge of statistics fundamentals: (generalized) linear models, mixed models, outlier detection

  • Must have experience cleaning, massaging, and organizing data for analysis.

  • Must have experience developing and applying statistical analysis including linear regression, principal components analysis, linear discriminant analysis

  • Experience implementing other statistical and ML models such as logistic regression, random forest, XGBoost

  • Communication skills: comfortable communicating through virtual presentations, in person meetings, and general conversations

  • Must demonstrate good organization/documentation practices with regards to annotating code, producing Jupyter notebooks, data team wiki-pages, task management, debugging, and issue ticketing.

  • Must be able to produce high quality and clearly communicated slide decks and other documents for team meetings and review

Preferred Additional Experience:

  • 1-3 years of work experience as a Data Analyst, Data Engineer, Data Scientist or similar

  • Familiarity with MySQL

  • Experience building Python packages

  • Experience working with spectral data, image processing, and/or signal processing

  • Experience conducting causality experiments by applying A/B experiments or other similar approaches to identify underlying problems of an observed result