科研与比赛经历 Research & Competition Experience

具身人工智能与机器人实验室 Embodied AI & Robotics Lab

研究助理 Research Assistant

2024年5月 - 至今 May 2024 - Present

纽约大学阿布扎比分校 NYU Abu Dhabi

研究内容 Research Focus

  • 收集并处理了13种手语数据集,用于双向手语翻译 Collected and processed sign language datasets in 13 languages for bidirectional sign language translation
  • 使用YouTube API和基于Python的自动化工具,从YouTube上收集了约2000个手语视频。这个GitHub仓库 展示了一个下载ASL(美国手语)视频的实例代码。 Collected approximately 2000 sign language videos from YouTube using YouTube API and Python-based automation tools. This GitHub repository demonstrates example code for downloading ASL (American Sign Language) videos.

Jane Street 实时市场数据预测 Jane Street Market Data Prediction

Kaggle(全球) Kaggle (Global)

团队成员,分析师 Team Member, Analyst

2024年10月 - 2024年1月7日 October 2024 - January 7, 2024

在全球3713支队伍中排名前4%(第143名),获得银牌 Ranked top 4% (143rd) among 3,713 global teams, earned Silver Medal

主要成果 Key Achievements

  • 分析了包含47,127,338行样本的大规模数据集,涵盖1,699个交易日(超过四年半)的市场数据。数据集包含79个特征,代表不同金融指标。这个GitHub仓库 包含了完整的代码实现,包括特征工程、模型训练和优化过程。 Analyzed a large-scale dataset containing 47,127,338 samples across 1,699 trading days (over 4.5 years). The dataset includes 79 features representing various financial indicators. This GitHub repository contains the complete code implementation, including feature engineering, model training, and optimization process.
  • 应用K折交叉验证并优化LGBM模型,将预测准确率提高了10% Applied K-fold cross-validation and optimized LGBM model, improving prediction accuracy by 10%
  • 通过超参数调优和稳健验证,将预测得分从0.0096提升至0.0083 Enhanced prediction score from 0.0096 to 0.0083 through hyperparameter tuning and robust validation