About the company
The Binance Exchange is a leading cryptocurrency exchange founded in 2017 in Hong Kong. It features a strong focus on altcoin trading. Binance offers crypto-to-crypto trading in more than 600 cryptocurrencies and virtual tokens, including Bitcoin (BTC), Ether (ETH), Litecoin (LTC), Dogecoin (DOGE), and its own token Binance Coin (BNB).
Job Summary
Responsibilitiesļ¼
šUtilize NLP techniques to preprocess, analyze, and extract insights from large textual datasets. Develop and implement NLP models to derive actionable insights and enhance business decision-making processes. šDesign, develop, and evaluate complex data models to support statistical analysis, machine learning, and other data-driven tasks. šEnsure data models are robust, scalable, and optimized for performance. šPerform data cleaning, transformation, and preprocessing to create high-quality datasets for analysis and modeling. šConduct exploratory data analysis to uncover patterns, trends, and relationships within the data. Generate visualizations and summaries to communicate findings to stakeholders. šDevelop and apply feature engineering techniques to create meaningful features that improve the performance of models. This includes deriving new features from raw data, selecting relevant features, and transforming existing features.
Requirements:
šProficient in designing, developing, and evaluating complex data models. Familiarity with statistical analysis and machine learning frameworks. šDeep understanding of modern machine learning techniques and mathematical underpinning, such as classifications, neural networks, hyperparameter optimization, etc. šStrong knowledge and experience in NLP techniques and tools for analyzing and extracting insights from textual data. šSolid understanding and practical experience with deep learning architectures, including transformer models (e.g., BERT, GPT). Ability to implement and optimize these models for various tasks. šProficiency in programming languages such as Python, R, or similar. šExperience with libraries and frameworks such as TensorFlow, PyTorch, Keras, and Scikit-learn. šDemonstrated experience in handling severely imbalanced datasets. šKnowledge of techniques and strategies to address imbalances in data. šHolds a Master's degree or higher in Computer Science, Data Science, Statistics, Mathematics, Computational Linguistics, or a related field. Current Master's or Ph.D. students are welcome to apply.