Jaydip Sen

and 6 more

In various real-world applications such as machine translation, sentiment analysis, and question answering, a pivotal role is played by NLP models, facilitating efficient communication and decision-making processes in domains ranging from healthcare to finance. However, a significant challenge is posed to the robustness of these natural language processing models by text adversarial attacks. These attacks involve the deliberate manipulation of input text to mislead the predictions of the model while maintaining human interpretability. Despite the remarkable performance achieved by state-of-the-art models like BERT in various natural language processing tasks, they are found to remain vulnerable to adversarial perturbations in the input text. In addressing the vulnerability of text classifiers to adversarial attacks, three distinct attack mechanisms are explored in this paper using the victim model BERT: BERT-on-BERT attack, PWWS attack, and Fraud Bargain’s Attack (FBA). Leveraging the IMDB, AG News, and SST2 datasets, a thorough comparative analysis is conducted to assess the effectiveness of these attacks on the BERT classifier model. It is revealed by the analysis that PWWS emerges as the most potent adversary, consistently outperforming other methods across multiple evaluation scenarios, thereby emphasizing its efficacy in generating adversarial examples for text classification. Through comprehensive experimentation, the performance of these attacks is assessed, and the findings indicate that the PWWS attack outperforms others, demonstrating lower runtime, higher accuracy, and favorable semantic similarity scores. The key insight of this paper lies in the assessment of the relative performances of three prevalent state-of-the-art attack mechanisms.

Jaydip Sen

and 1 more

Jaydip Sen

and 2 more

Prediction of stock prices has been an important area of research for a long time. While supporters of the efficient market hypothesis believe that it is impossible to predict stock prices accurately, there are formal propositions demonstrating that accurate modeling and designing of appropriate variables may lead to models using which stock prices and stock price movement patterns can be very accurately predicted. Researchers have also worked on technical analysis of stocks with a goal of identifying patterns in the stock price movements using advanced data mining techniques. In this work, we propose an approach of hybrid modeling for stock price prediction building different machine learning and deep learning-based models. For the purpose of our study, we have used NIFTY 50 index values of the National Stock Exchange (NSE) of India, during the period December 29, 2014 till July 31, 2020. We have built eight regression models using the training data that consisted of NIFTY 50 index records from December 29, 2014 till December 28, 2018. Using these regression models, we predicted the open values of NIFTY 50 for the period December 31, 2018 till July 31, 2020. We, then, augment the predictive power of our forecasting framework by building four deep learning-based regression models using long-and short-term memory (LSTM) networks with a novel approach of walk-forward validation. Using the grid-searching technique, the hyperparameters of the LSTM models are optimized so that it is ensured that validation losses stabilize with the increasing number of epochs, and the convergence of the validation accuracy is achieved. We exploit the power of LSTM regression models in forecasting the future NIFTY 50 open values using four different models that differ in their architecture and in the structure of their input data. Extensive results are presented on various metrics for all the regression models. The results clearly indicate that the LSTM-based univariate model that uses one-week prior data as input for predicting the next week’s open value of the NIFTY 50 time series is the most accurate model.

Jaydip Sen

and 1 more

Prediction of future movement of stock prices has been a subject matter of many research work. There is a gamut of literature of technical analysis of stock prices where the objective is to identify patterns in stock price movements and derive profit from it. Improving the prediction accuracy remains the single most challenge in this area of research. We propose a hybrid approach for stock price movement prediction using machine learning, deep learning, and natural language processing. We select the NIFTY 50 index values of the National Stock Exchange (NSE) of India, and collect its daily price movement over a period of three years (2015 – 2017). Based on the data of 2015 – 2017, we build various predictive models using machine learning, and then use those models to predict the closing value of NIFTY 50 for the period January 2018 till June 2019 with a prediction horizon of one week. For predicting the price movement patterns, we use a number of classification techniques, while for predicting the actual closing price of the stock, various regression models have been used. We also build a Long and Short-Term Memory (LSTM)-based deep learning network for predicting the closing price of the stocks and compare the prediction accuracies of the machine learning models with the LSTM model. We further augment the predictive model by integrating a sentiment analysis module on Twitter data to correlate the public sentiment of stock prices with the market sentiment. This has been done using Twitter sentiment and previous week closing values to predict stock price movement for the next week. We tested our proposed scheme using a cross validation method based on Self Organizing Fuzzy Neural Networks (SOFNN) and found extremely interesting results.

Aayush Sahu

and 6 more

This work presents a cointegration-based pair-trading strategy for identifying stock pairs with substantial cointegration in their prices across four years (January 1, 2018, to December 31, 2021). After the Cointegrated pairs are determined, pair-trading portfolios are created, and portfolio performance is tracked during a one-year test period (January 1, 2022, to December 31, 2021). Suitable trigger points are determined utilizing a very powerful spread detection system, allowing both stocks’ short and long positions to be precisely recognized. The yearly return of a portfolio is used to assess its performance. First, twelve sectors of stocks from the National Stock Exchange (NSE) of India are selected. According to the NSE’s monthly report for the month of December 31, 2022, the top ten stocks in terms of their free-float market capitalization from the twelve sectors are selected. Pair trading portfolios are built using pairs from each sector that demonstrated cointegration of close prices from January 1, 2018, to December 31, 2021. The portfolios are evaluated based on their return from January 1, 2022, to December 31, 2022. Furthermore, clustering-based techniques were used to identify stocks that behaved similarly for the period of four years i.e., January 1, 2018, to December 31, 2021. This was performed by using three clustering techniques and the best technique, based on their respective results, was chosen to identify the clusters to verify the cointegrated pairs. Henceforth, the pairs which were common in both cointegration, and clustering techniques were regarded as the most recommended pairs for trading. The work makes three distinct contributions. First, the paper provides a cointegration-based pair trading strategy for stock portfolio creation, that can be used to earn profit by the investors in the stock market. Second, the pair-trading models are trained and tested on real-world stock market data, with the results displayed to illustrate the models’ efficacy. Finally, since the stocks utilized in the pair trading portfolio designs are drawn from various NSE sectors, the outcomes of the pairings are an excellent signal of the possible profit that investors could make if they invest in those sectors using the recommended pair-trading technique.