Middle Data Scientist
Прямой работодатель WaveAccess ( waveaccess.ru )
Опыт работы от 3 до 5 лет
WaveAccess is looking for a Data Scientist to join our team and work on diverse projects for international clients. You can solve real-world business problems using various data science techniques, modern technology stacks, and advanced methodologies.
Responsibilities:
- Work with multiple data sources - collect, clean, analyze, and interpret data to provide valuable insights (incl. semi-structured document data)
- Build end-to-end data processing pipelines in Dataiku (datasets, recipes, scenarios) with reproducible results
- Extract and normalize information from documents - parse PDF and DOCX, handle noisy layouts, tables, and mixed structures
- Formulate and defend hypotheses to address complex business challenges; run offline experiments and interpret results
- Develop and implement data-driven solutions using Python tools;
- Integrate lightweight LLM prompting when it improves quality;
- Implement agentic solutions
- Collaborate closely with cross-functional teams to understand client requirements and deliver impactful results
- Present achieved results clearly - metrics, limitations, next steps, business value
Requirements:
- At least 3 years of commercial development experience
- English - B2 or higher (Upper Intermediate)
- Proficiency in Python and practical experience with pandas and other standard data science tools
- Experience with LLM/RAG (pragmatic, lightweight implementations; evaluation and safety/quality checks) and Agentic systems
- Experience with document/text data processing in applied tasks (PDF/DOCX extraction, normalization)
- Experience in presenting achieved results (both technical and business stakeholders)
- Familiarity with classic machine-learning approaches and algorithms (scikit-learn)
Technical Skills:
- Python
- LLM/RAG
- LangChain (light usage - prompt orchestration / calling LLMs)
- Agentic systems (tool/pipeline orchestration patterns)
- pandas
- scikit-learn (sklearn)
- Classic NLP stack
- Basic SQL
Preferred Experience:
- Experience with production workflows in Dataiku (scenarios, automation, monitoring, packaging)
- Experience using data visualization tools to communicate insights effectively (Python or Dataiku)
- Familiarity with deployment and version control tools like Docker and Git
- Experience working in Agile development environments
What We Offer:
- Work in a dynamic international team
- Employment according to labor law, 100% payment for sick leave and vacation
- Opportunity for cooperation through individual entrepreneurship/self-employment
- Participation in foreign and Russian projects
- Health insurance with dental coverage
- Necessary equipment for work
- Corporate training programs
- Broad opportunities for self-realization, professional and career growth
- Democratic approach to processes and flexible start of the workday
