arkin.gupta@gmail.com

I am a senior at UC San Diego pursuing a Bachelor of Science in mathematics and computer science, focusing on machine learning and quantitative research. I'll be joining BlackRock as an analyst in July 2019.

Last summer I worked at BlackRock as a summer analyst on their ETF & Index Investments research team working on natural language processing. At UC San Diego, I am a TA/Tutor for the Computer Science & Engineering department and an undergraduate researcher at the Mathematical Neuroscience lab . Prior to that, I interned at CareFusion BD as a data scientist working on time-series forecasting and machine learning models.

Team: ETF & Index Investments Global Research and Analytics

Developed time-series forecasting, machine learning models to predict drug shortages, and price changes. Effectively analyzed and visualized datatsets with more than 10 Million drug usage and transaction records. Used dimension-reduction techniques (PCA, SVD, LDA), fourier and log transformations and resampling techniques (bagging, boosting) to identify correlations between variables and extract underlying patterns of the data. Worked with multiple regression and classificatoin models - linear models, ARIMA, boosted trees (xgboost), random forests and SVMs.

Tutor for Object-Oriented Programming (CSE 11) and Data Structures (CSE 12). Worked with the instructor to design and write programming assignments and their specifications. Held office hours and led review sessions to explain programming concepts and assist students in implementing programming assignments by analyzing and debugging their code. Graded homework, exams and wrote submission/grading scripts for programming assignments.

Wrote python scripts to setup a Continuous Integration server to automate package builds. Developed multiple native Linux (Ubuntu, CentOS, Debian) packages using bash and python for Kolibri - Learning Equality's flagship application. Optimized software setup on all platforms by implementing efficient installation scripts.

Developed a user-friendly, sustainable web-application for processing and storage of archaeological data. Implemented multiple features in Ruby on Rails, including auto-complete site search and backups. Worked extensively with legacy code, debugged JavaScript bugs and re-factored the SQL database.

My reserach interests include machine learning, neural networks and investing strategies. In particular, I'm interested in research applicable to finance (buy-side), macroeconomics and neuroscience.

Developing neural derived and neuro-mimetic machine learning algorithms. Constructing complex and dynamic artificial neural networks by incorporating neural features such as propagation decay, geometric information and refractory periods. Writing python code to generate and train such neural networks, run experiments and analyze results. Collaborating with Microsoft’s Special Projects division.

Used underwater sound pressure from active and passive sources to train ML models for various applications. Developed deep spatio-temporal (Convolutional LSTM) neural networks to predict ship paths.

Open-source, group projects I worked on for courses and at hackathons.

Support vector machines (SVMs) are an extremely powerful machine learning tool to solve various classification problems. Not only are they less prone to over-fitting due to large margins, but they are also easy to optimize due to their convex nature. In this paper we will review both soft and hard margin formulations of linear SVMs. First, we discuss how to solve soft-margin SVMs via dual formulation, and justify how the dual problem will in-fact give the optimal solution of primal form. Then, we discuss kernel tricks to solve non-linear classification using convex optimization. Finally, we perform classification on real-world data using both non-linear and linear SVMs using the algorithms devised prior.

Analysis of the negative effects of Gentrification in San Diego in the 21st Century. Visualized the change in demographics of all neighborhoods in San Diego using heat maps. Identified neighborhoods effected the most by Gentrification and found patterns between multiple socio-economic factors such as Poverty, Population, Uninsurance and Property value. Languages/Tools Used: Python (Pandas, NumPy, Matplotlib, Patsy), Jupyter Notebooks.

Data science powered web application to perform sentiment analysis on YouTube comments. Applied machine learning techniques on the model using a training dataset of 1 Million tweets. Wrote python scripts for web scraping and performing sentiment analysis on the comments. Languages/Tools Used: Python, Natural Language Toolkit, Flask.

Android application to ease the process of connecting with people on multiple social media. Integrated database, added Location tracking and developed the app structure. Languages/Tools Used: Java (Android), XML, Google Firebase.

Mobile app to unify the process of connecting with people on multiple social media. Integrated the database, developed Location tracking functionality, and designed the app structure. Languages/Tools Used: JavaScript, HTML/CSS, MongoDB

*UC San Diego, Winter 2019*

*UC San Diego, Winter 2018*

*UC San Diego, Spring 2017*

*Projects Chair*

*Online Content Lead*

- ECE 273 - Convex Optimization (Graduate) CSE 250B - Learning Algorithms (Graduate)
- MATH 180A - Probablity
- MATH 180BC - Stochastic Processes I & II
- MATH 181AB - Mathematical Statistics I & II
- MATH 100ABC - Abstract Algebra I, II, III
- MATH 184 - Combinatorics
- MATH 170A - Numerical Linear Algebra
- MATH 109 - Mathematical Reasoning
- MATH 20D - Ordinary Differential Equations
- MATH 20CE - Multivariable & Vector Calculus
- MATH 20AB - Calculus I & II
- MATH 18 - Linear Algebra
- ECON 109T - Advanced Topics in Game Theory
- ECON 109 - Game Theory
- COGS 108 - Data Science in Practice
- MGT 183 - Financial Investments
- CSE 158 - Recommender Systems & Data Mining
- CSE 105 - Theory of Computation
- CSE 101 - Design and Analysis of Algorithms
- CSE 100 - Advanced Data Structures
- CSE 30 - Computer Organization and Systems Programming
- CSE 20 - Discrete Mathematics
- CSE 15L - Software Tools & Techniques
- CSE 12 - Data Strcutures & OOP Design
- CSE 11 - Object-Oriented Programming

Website cloned from here.